Reviews: Regularized Anderson Acceleration for Off-Policy Deep Reinforcement Learning

Jan-26-2025, 17:24:01 GMT–Neural Information Processing Systems

This work is an interesting contribution to deep RL that considers using Anderson acceleration to improve off-policy TD based algorithms. The approach is supported by some theory as well as experiments on standard benchmark problems. Overall, reviewers like the paper and agree it should be accepted.

off-policy deep reinforcement learning, regularized anderson acceleration

Neural Information Processing Systems

Jan-26-2025, 17:24:01 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.85)