DeepRacer: Educational Autonomous Racing Platform for Experimentation with Sim2Real Reinforcement Learning
Balaji, Bharathan, Mallya, Sunil, Genc, Sahika, Gupta, Saurabh, Dirac, Leo, Khare, Vineet, Roy, Gourav, Sun, Tao, Tao, Yunzhe, Townsend, Brian, Calleja, Eddie, Muralidhara, Sunil, Karuppasamy, Dhanasekar
–arXiv.org Artificial Intelligence
-- DeepRacer is a platform for end-to-end experimentation with RL and can be used to systematically investigate the key challenges in developing intelligent control systems. Using the platform, we demonstrate how a 1/18th scale car can learn to drive autonomously using RL with a monocular camera. It is trained in simulation with no additional tuning in physical world and demonstrates: 1) formulation and solution of a robust reinforcement learning algorithm, 2) narrowing the reality gap through joint perception and dynamics, 3) distributed on-demand compute architecture for training optimal policies, and 4) a robust evaluation method to identify when to stop training. It is the first successful large-scale deployment of deep reinforcement learning on a robotic control agent that uses only raw camera images as observations and a model-free learning method to perform robust path planning. Due to high sample complexity and safety requirements, it is common to train the RL agent in simulation [1], [5], [17]. To reduce training time and encourage exploration, the agent is usually trained with distributed rollouts [18], [19], [20], [21]. For a successful transfer to the real world, researchers use calibration [2], [22], domain randomization [23], [24], [25], [12], fine tuning with real world data [9], and learn features from a combination of simulation and real data [26], [27]. To experiment with robotic reinforcement learning, one needs to have expertise in many areas, access to a physical robot, an accurate robot model for simulations, a distributed training mechanism and customizability of the training procedure such as modifying the neural network and the loss function or introducing noise. For the uninitiated, dealing with this complexity is daunting and dissuades adoption. As a result, much of prior work is limited to a single robot [1], [23], [28] or a few robots [16]. We reduce the learning curve and alleviate development effort with DeepRacer.
arXiv.org Artificial Intelligence
Nov-4-2019
- Country:
- Asia
- Japan > Honshū
- Kansai > Hyogo Prefecture
- Kobe (0.04)
- Tōhoku > Miyagi Prefecture
- Sendai (0.04)
- Kansai > Hyogo Prefecture
- Middle East
- Jordan (0.05)
- Republic of Türkiye > Karaman Province
- Karaman (0.04)
- Japan > Honshū
- Europe
- Germany > Bavaria
- Upper Bavaria > Munich (0.04)
- Hungary > Budapest
- Budapest (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- United Kingdom > England
- Greater London > London (0.04)
- Germany > Bavaria
- North America
- Puerto Rico > San Juan
- San Juan (0.04)
- United States
- California > Los Angeles County
- Long Beach (0.04)
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- California > Los Angeles County
- Puerto Rico > San Juan
- Asia
- Genre:
- Research Report (0.64)
- Industry:
- Education > Educational Setting (0.46)
- Leisure & Entertainment > Sports (0.68)
- Transportation
- Technology: