Reviews: Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update

Jan-27-2025, 19:20:13 GMT–Neural Information Processing Systems

All reviewers recommend accepting the paper. The authors response did address most of the reviewers' concerns. While the AC recommends accepting the paper, the AC encourages the authors to consider the comments of reviewer 1. Only changing the backup mechanism keeping all other hyper parameters fixed as in the Nature DQN model is indeed a good experimental setup. However, the optimal operation mode for different models might be different (even when sharing architectures and training protocols): for instance we could'afford' a larger learning rate if we have a better back-up mechanism.

episodic backward update, review, sample-efficient deep reinforcement learning, (1 more...)

Neural Information Processing Systems

Jan-27-2025, 19:20:13 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.97)