Distributed Multitask Reinforcement Learning with Quadratic Convergence
Tutunov, Rasul, Kim, Dongho, Ammar, Haitham Bou
–Neural Information Processing Systems
Multitask reinforcement learning (MTRL) suffers from scalability issues when the number of tasks or trajectories grows large. The main reason behind this drawback is the reliance on centeralised solutions. Recent methods exploited the connection between MTRL and general consensus to propose scalable solutions. These methods, however, suffer from two drawbacks. First, they rely on predefined objectives, and, second, exhibit linear convergence guarantees. In this paper, we improve over state-of-the-art by deriving multitask reinforcement learning from a variational inference perspective. We then propose a novel distributed solver for MTRL with quadratic convergence guarantees.
Neural Information Processing Systems
Dec-31-2018
- Country:
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.14)
- North America
- Canada > Quebec
- Montreal (0.04)
- United States
- Massachusetts > Middlesex County
- Cambridge (0.04)
- New York > New York County
- New York City (0.04)
- Massachusetts > Middlesex County
- Canada > Quebec
- Europe > United Kingdom
- Genre:
- Research Report > New Finding (0.68)
- Technology: