Two Timescale Stochastic Approximation with Controlled Markov noise and Off-policy temporal difference learning

Feb-25-2017–arXiv.org Artificial Intelligence

Stochastic approximation algorithms are sequential nonparametric methods for finding a zero or minimum of a function in the situation where only the noisy observations of the function values are available. Two timescale stochastic approximation algorithms represent one of the most general subclasses of stochastic approximation methods. These algorithms consist of two coupled recursions which are updated with different (one is considerably smaller than the other) step sizes which in turn facilitate convergence for such algorithms. Two timescale stochastic approximation algorithms [19] have successfully been applied to several complex problems arising in the areas of reinforcement learning, signal processing and admission control in communication networks. There are many reinforcement learning applications (precisely those where parameterization of value function is implemented) where non-additive Markov noise is present in one or both iterates thus requiring the current two timescale framework to be extended to include Markov noise (for example, in [13, p. 5] it is mentioned that in order to generalize the analysis to Markov noise, the theory of two timescale stochastic approximation needs to include the latter).

artificial intelligence, machine learning, reinforcement learning, (19 more...)

arXiv.org Artificial Intelligence

Feb-25-2017

arXiv.org PDF

Add feedback

Country:
- Asia > India
  - Karnataka > Bengaluru (0.04)
- Europe > United Kingdom
  - England > Cambridgeshire
    - Cambridge (0.04)
  - Scotland (0.04)
- North America
  - Canada
    - Alberta (0.14)
    - British Columbia > Metro Vancouver Regional District
      - Vancouver (0.04)
    - Quebec > Montreal (0.04)
  - United States > New York (0.04)

Genre:
- Research Report (0.64)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Reinforcement Learning (1.00)
  - Representation & Reasoning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found