Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation
Yuhuai Wu, Elman Mansimov, Roger B. Grosse, Shun Liao, Jimmy Ba
–Neural Information Processing Systems
In this work, we propose to apply trust region optimization to deep reinforcement learning using a recently proposed Kronecker-factored approximation to the curvature.
Neural Information Processing Systems
Nov-21-2025, 07:08:24 GMT
- Country:
- Asia > Middle East
- Jordan (0.04)
- North America
- Canada > Ontario
- Toronto (0.15)
- United States > California
- Los Angeles County > Long Beach (0.04)
- Canada > Ontario
- Asia > Middle East
- Industry:
- Leisure & Entertainment > Games (0.49)
- Technology: