Variance Reduced Policy Evaluation with Smooth Function Approximation
Hoi-To Wai, Mingyi Hong, Zhuoran Yang, Zhaoran Wang, Kexin Tang
–Neural Information Processing Systems
Policy evaluation with smooth and nonlinear function approximation has shown great potential for reinforcement learning. Compared to linear function approximation, it allows for using a richer class of approximation functions such as the neural networks. Traditional algorithms are based on two timescales stochastic approximation whose convergence rate is often slow.
Neural Information Processing Systems
Oct-2-2025, 20:32:37 GMT
- Country:
- Asia
- China > Hong Kong (0.04)
- Middle East > Jordan (0.04)
- North America
- Canada (0.04)
- United States
- Illinois > Cook County
- Evanston (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.28)
- New Jersey > Mercer County
- Princeton (0.04)
- Illinois > Cook County
- Asia
- Technology: