The Potential of the Return Distribution for Exploration in RL

Moerland, Thomas M., Broekens, Joost, Jonker, Catholijn M.

Jun-11-2018–arXiv.org Artificial Intelligence

This paper studies the potential of the return distribution for exploration in deterministic environments. We study network losses and propagation mechanisms for Gaussian, Categorical and Mixture of Gaussian distributions. Combined with exploration policies that leverage this return distribution, we solve, for example, a randomized Chain task of length 100, which has not been reported before when learning with neural networks.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

arXiv.org Artificial Intelligence

Jun-11-2018

arXiv.org PDF

Add feedback

Country:
- Europe
  - Denmark (0.04)
  - Sweden > Stockholm
    - Stockholm (0.04)
  - Netherlands > South Holland
    - Delft (0.04)

Genre:
- Research Report (0.84)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Machine Learning
    - Reinforcement Learning (0.70)
    - Neural Networks (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found