Deep Exploration via Randomized Value Functions

Osband, Ian, Russo, Daniel, Wen, Zheng, Van Roy, Benjamin

Mar-22-2017–arXiv.org Machine Learning

We study the use of randomized value functions to guide deep exploration in reinforcement learning. This offers an elegant means for synthesizing statistically and computationally efficient exploration with common practical approaches to value function learning. We present several reinforcement learning algorithms that leverage randomized value functions and demonstrate their efficacy through computational studies. We also prove a regret bound that establishes statistical efficiency with a tabular representation.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

arXiv.org Machine Learning

Mar-22-2017

arXiv.org PDF

Add feedback

Genre:
- Research Report > New Finding (0.46)

Industry:
- Leisure & Entertainment > Games > Computer Games (0.67)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Uncertainty
    - Bayesian Inference (0.67)
  - Machine Learning
    - Reinforcement Learning (1.00)
    - Learning Graphical Models > Directed Networks
      - Bayesian Learning (0.67)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found