Taylor Expansion Policy Optimization
Tang, Yunhao, Valko, Michal, Munos, Rémi
In this work, we investigate the application of Taylor expansions in reinforcement learning. In particular, we propose Taylor expansion policy optimization, a policy optimization formalism that generalizes prior work (e.g., TRPO) as a first-order special case. We also show that Taylor expansions intimately relate to off-policy evaluation. Finally, we show that this new formulation entails modifications which improve the performance of several state-of-the-art distributed algorithms.
Mar-13-2020
- Country:
- North America > United States
- New York (0.04)
- Europe > France
- Île-de-France > Paris > Paris (0.04)
- Asia > Middle East
- Jordan (0.04)
- North America > United States
- Genre:
- Research Report (0.50)
- Industry:
- Leisure & Entertainment > Games (0.93)
- Technology: