Deep Exploration via Bootstrapped DQN

Osband, Ian, Blundell, Charles, Pritzel, Alexander, Roy, Benjamin Van

Feb-14-2020, 15:26:12 GMT–Neural Information Processing Systems

Efficient exploration remains a major challenge for reinforcement learning (RL). Common dithering strategies for exploration, such as epsilon-greedy, do not carry out temporally-extended (or deep) exploration; this can lead to exponentially larger data requirements. However, most algorithms for statistically efficient RL are not computationally tractable in complex environments. Randomized value functions offer a promising approach to efficient exploration with generalization, but existing algorithms are not compatible with nonlinearly parameterized value functions. As a first step towards addressing such contexts we develop bootstrapped DQN.

bootstrapped dqn, deep exploration, exploration, (1 more...)

Neural Information Processing Systems

Feb-14-2020, 15:26:12 GMT

Conferences Web Page

Add feedback

Genre:
- Research Report (0.47)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.31)