Trust Region-Guided Proximal Policy Optimization
Yuhui Wang, Hao He, Xiaoyang Tan, Yaozhong Gan
–Neural Information Processing Systems
Deep model-free reinforcement learning has achieved great successes in recent years, notably in video games [11], board games [19], robotics [10], and challenging control tasks [17,5].
Neural Information Processing Systems
Feb-13-2026, 10:04:27 GMT
- Country:
- Asia
- China (0.04)
- Middle East > Jordan (0.04)
- North America > Canada
- Asia
- Industry:
- Leisure & Entertainment > Games (0.68)
- Technology: