Damped Anderson Mixing for Deep Reinforcement Learning: Acceleration, Convergence, and Stabilization
–Neural Information Processing Systems
In this paper, we provide deeper insights into Anderson acceleration in reinforcement learning by establishing its connection with quasi-Newton methods for policy iteration and improved convergence guarantees under the assumptions that the Bellman operator is differential and non-expansive.
Neural Information Processing Systems
Oct-2-2025, 18:22:48 GMT
- Country:
- Asia > China
- Heilongjiang Province > Harbin (0.04)
- North America > Canada
- Asia > China
- Industry:
- Leisure & Entertainment > Games (0.47)
- Technology: