Let's Play Again: Variability of Deep Reinforcement Learning Agents in Atari Environments

Clary, Kaleigh, Tosch, Emma, Foley, John, Jensen, David

Apr-12-2019–arXiv.org Artificial Intelligence

Reproducibility in reinforcement learning is challenging: uncontrolled stochasticity from many sources, such as the learning algorithm, the learned policy, and the environment itself have led researchers to report the performance of learned agents using aggregate metrics of performance over multiple random seeds for a single environment. Unfortunately, there are still pernicious sources of variability in reinforcement learning agents that make reporting common summary statistics an unsound metric for performance. Our experiments demonstrate the variability of common agents used in the popular OpenAI Baselines repository. We make the case for reporting post-training agent performance as a distribution, rather than a point estimate.

machine learning, reinforcement learning, variability, (17 more...)

arXiv.org Artificial Intelligence

Apr-12-2019

arXiv.org PDF

Add feedback

Country:
- North America > United States > Massachusetts > Hampshire County > Amherst (0.16)

Genre:
- Research Report (0.64)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found