Rates of Convergence of Performance Gradient Estimates Using Function Approximation and Bias in Reinforcement Learning

Grudic, Gregory Z., Ungar, Lyle H.

Dec-31-2002–Neural Information Processing Systems

We address two open theoretical questions in Policy Gradient Reinforcement Learning. The first concerns the efficacy of using function approximation to represent the state action value function,.

algorithm, convergence, performance gradient, (12 more...)

Neural Information Processing Systems

Dec-31-2002

Conferences PDF

Country:
- North America > United States
  - Pennsylvania (0.04)
  - Massachusetts > Middlesex County
    - Cambridge (0.05)
  - Colorado > Boulder County
    - Boulder (0.04)
  - California > San Mateo County
    - Menlo Park (0.04)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Reinforcement Learning (0.87)
  - Representation & Reasoning > Uncertainty
    - Fuzzy Logic (0.62)

Duplicate Docs Excel Report

Title
Rates of Convergence of Performance Gradient Estimates Using Function Approximation and Bias in Reinforcement Learning
Rates of Convergence of Performance Gradient Estimates Using Function Approximation and Bias in Reinforcement Learning

Similar Docs Excel Report more

Title	Similarity	Source
None found