Reviews: Search on the Replay Buffer: Bridging Planning and Reinforcement Learning
–Neural Information Processing Systems
The paper presents a general-purpose control algorithm combining planning and RL to solve tasks with sparse rewards or with long horizon. This algorithm is novel and interesting. The three reviewers agree that the contributions presented here should be published at the conference. The rebuttal helped solving most clarification issues. The reviewers also suggest various ways to further improve the manuscript.
Neural Information Processing Systems
Jan-24-2025, 00:30:28 GMT
- Technology: