Sparse Curriculum Reinforcement Learning for End-to-End Driving
Agarwal, Pranav, de Beaucorps, Pierre, de Charette, Raoul
–arXiv.org Artificial Intelligence
Abstract-- Deep reinforcement Learning for end-to-end driving is limited by the need of complex reward engineering. Sparse rewards can circumvent this challenge but suffers from long training time and leads to sub-optimal policy. In this work, we explore driving using only goal conditioned sparse rewards and propose a curriculum learning approach for end to end driving using only navigation view maps that benefit from small virtual-to-real domain gap. To address the complexity of multiple driving policies, we learn concurrent individual policies which are selected at inference by a navigation system. Figure 1: End to End Driving. Performance of Deep RL in competing with humans in games like Atari [32], AlphaGo [46] and Dota [2] has shown its potential to solve complex decision making problems for navigation decision at intersections (e.g.
arXiv.org Artificial Intelligence
Mar-16-2021
- Genre:
- Research Report (0.64)
- Industry:
- Information Technology (0.49)
- Automobiles & Trucks (0.48)
- Transportation > Ground
- Road (0.48)
- Leisure & Entertainment > Games
- Go (0.34)
- Technology: