Rates of Convergence of Performance Gradient Estimates Using Function Approximation and Bias in Reinforcement Learning

Grudic, Gregory Z., Ungar, Lyle H.

Dec-31-2002–Neural Information Processing Systems

We address two open theoretical questions in Policy Gradient Reinforcement Learning.The first concerns the efficacy of using function approximation torepresent the state action value function, .

machine learning, performance gradient, reinforcement learning, (15 more...)

Neural Information Processing Systems

Dec-31-2002

Conferences PDF

Country:
- North America > United States > Colorado (0.14)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Reinforcement Learning (0.86)
  - Representation & Reasoning > Uncertainty
    - Fuzzy Logic (0.62)

Duplicate Docs Excel Report

Title
Rates of Convergence of Performance Gradient Estimates Using Function Approximation and Bias in Reinforcement Learning
Rates of Convergence of Performance Gradient Estimates Using Function Approximation and Bias in Reinforcement Learning

Similar Docs Excel Report more

Title	Similarity	Source
None found