Variance Reduced Policy Evaluation with Smooth Function Approximation

Hoi-To Wai, Mingyi Hong, Zhuoran Yang, Zhaoran Wang, Kexin Tang

Oct-2-2025, 20:32:37 GMT–Neural Information Processing Systems

Policy evaluation with smooth and nonlinear function approximation has shown great potential for reinforcement learning. Compared to linear function approximation, it allows for using a richer class of approximation functions such as the neural networks. Traditional algorithms are based on two timescales stochastic approximation whose convergence rate is often slow.

artificial intelligence, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Oct-2-2025, 20:32:37 GMT

Conferences PDF

Add feedback

Country:
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.28)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Reinforcement Learning (1.00)
  - Representation & Reasoning > Uncertainty
    - Fuzzy Logic (0.84)

Duplicate Docs Excel Report

Title
Variance Reduced Policy Evaluation with Smooth Function Approximation

Similar Docs Excel Report more

Title	Similarity	Source
None found