Regularized Anderson Acceleration for Off-Policy Deep Reinforcement Learning
Wenjie Shi, Shiji Song, Hui Wu, Ya-Chu Hsu, Cheng Wu, Gao Huang
–Neural Information Processing Systems
Model-free deepreinforcement learning (RL)algorithms havebeenwidely used for a range of complex control tasks.
Neural Information Processing Systems
Feb-13-2026, 20:02:50 GMT
- Country:
- Asia
- China > Beijing
- Beijing (0.05)
- Middle East > Jordan (0.04)
- China > Beijing
- North America > Canada
- Asia
- Technology: