Divergence-Augmented Policy Optimization
Qing Wang, Yingru Li, Jiechao Xiong, Tong Zhang
–Neural Information Processing Systems
In deep reinforcement learning, policy optimization methods need to deal with issues such asfunction approximation andthereuse ofoff-policydata.
Neural Information Processing Systems
Feb-14-2026, 03:45:53 GMT
- Country:
- Asia
- China > Guangdong Province
- Middle East > Jordan (0.04)
- Europe
- Hungary > Budapest
- Budapest (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Hungary > Budapest
- North America
- Canada > British Columbia
- United States > Illinois
- Cook County > Chicago (0.04)
- Asia
- Technology: