main_final
–Neural Information Processing Systems
Direct policy search serves as one of the workhorses in modern reinforcement learning (RL), and its applications in continuous control tasks have recently attracted increasing attention. In this work, we investigate the convergence theory of policy gradient (PG) methods for learning the linear risk-sensitive and robust controller.
Neural Information Processing Systems
Oct-2-2025, 14:26:23 GMT
- Country:
- Asia > Middle East
- Jordan (0.04)
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America > United States
- Illinois (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Asia > Middle East
- Genre:
- Research Report (0.46)
- Industry:
- Government (0.67)
- Technology: