Goto

Collaborating Authors

 Reinforcement Learning



Mo' States Mo' Problems: Emergency Stop Mechanisms from Observation

Neural Information Processing Systems

In this paper, we consider the problem of determining when along a training roll-out feedback from the environment is no longer beneficial, and an intervention such as resetting the agent to the initial state distribution is warranted. We show that such interventions can naturally trade off a small sub-optimality gap for a dramatic decrease in sample complexity. In particular, we focus on the reinforcement learning setting in which the agent has access to a reward signal in addition to either (a) an expert supervisor triggering the e-stop mechanism in real-time or (b) expert state-only demonstrations used to "learn" an automatic e-stop trigger.




Model-based Adversarial Meta-Reinforcement Learning

Neural Information Processing Systems

Meta-reinforcement learning and multi-task reinforcement learning aim to improve the sample efficiency by leveraging the shared structure within a family of tasks.





A game-theoretic analysis of networked system control for common-pool resource management using multi-agent reinforcement learning Arnu Pretorius InstaDeep Cape Town, South Africa Scott Cameron

Neural Information Processing Systems

Multi-agent reinforcement learning has recently shown great promise as an approach to networked system control. Arguably, one of the most difficult and important tasks for which large scale networked system control is applicable is common-pool resource management.