Mo' States Mo' Problems: Emergency Stop Mechanisms from Observation

Ainsworth, Samuel, Barnes, Matt, Srinivasa, Siddhartha

Dec-3-2019–arXiv.org Machine Learning

In many environments, only a relatively small subset of the complete state space is necessary in order to accomplish a given task. We develop a simple technique using emergency stops (e-stops) to exploit this phenomenon. Using e-stops significantly improves sample complexity by reducing the amount of required exploration, while retaining a performance bound that efficiently trades off the rate of convergence with a small asymptotic sub-optimality gap. We analyze the regret behavior of e-stops and present empirical results in discrete and continuous settings demonstrating that our reset mechanism can provide order-of-magnitude speedups on top of existing reinforcement learning methods.

algorithm, probability, reinforcement, (16 more...)

arXiv.org Machine Learning

Dec-3-2019

arXiv.org PDF

Add feedback

Country:
- North America
  - United States (0.28)
  - Canada (0.04)

Genre:
- Research Report > New Finding (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found