Review for NeurIPS paper: Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning
–Neural Information Processing Systems
The paper introduces the BIRD algorithm, a model-based RL algorithm based on differentiable planning (SVG-like). A key aspect of BIRD is a Mutual Information term in the loss function, which encourages the similarity of the imaginary data and the real observations. Reviewers generally liked this paper, even though there have been some concerns related to the extent of its novelty, especially compared to Dreamer. I summarize some of the concerns here, which should be addressed in the revised version of this work. Please refer to the reviews for more detail, and revise your paper by incorporating their comments.
Neural Information Processing Systems
Jan-25-2025, 04:35:41 GMT
- Technology: