DeepRacer: Educational Autonomous Racing Platform for Experimentation with Sim2Real Reinforcement Learning

Balaji, Bharathan, Mallya, Sunil, Genc, Sahika, Gupta, Saurabh, Dirac, Leo, Khare, Vineet, Roy, Gourav, Sun, Tao, Tao, Yunzhe, Townsend, Brian, Calleja, Eddie, Muralidhara, Sunil, Karuppasamy, Dhanasekar

Nov-4-2019–arXiv.org Artificial Intelligence

-- DeepRacer is a platform for end-to-end experimentation with RL and can be used to systematically investigate the key challenges in developing intelligent control systems. Using the platform, we demonstrate how a 1/18th scale car can learn to drive autonomously using RL with a monocular camera. It is trained in simulation with no additional tuning in physical world and demonstrates: 1) formulation and solution of a robust reinforcement learning algorithm, 2) narrowing the reality gap through joint perception and dynamics, 3) distributed on-demand compute architecture for training optimal policies, and 4) a robust evaluation method to identify when to stop training. It is the first successful large-scale deployment of deep reinforcement learning on a robotic control agent that uses only raw camera images as observations and a model-free learning method to perform robust path planning. Due to high sample complexity and safety requirements, it is common to train the RL agent in simulation [1], [5], [17]. To reduce training time and encourage exploration, the agent is usually trained with distributed rollouts [18], [19], [20], [21]. For a successful transfer to the real world, researchers use calibration [2], [22], domain randomization [23], [24], [25], [12], fine tuning with real world data [9], and learn features from a combination of simulation and real data [26], [27]. To experiment with robotic reinforcement learning, one needs to have expertise in many areas, access to a physical robot, an accurate robot model for simulations, a distributed training mechanism and customizability of the training procedure such as modifying the neural network and the loss function or introducing noise. For the uninitiated, dealing with this complexity is daunting and dissuades adoption. As a result, much of prior work is limited to a single robot [1], [23], [28] or a few robots [16]. We reduce the learning curve and alleviate development effort with DeepRacer.

international conference, reinforcement learning, simulation, (12 more...)

arXiv.org Artificial Intelligence

Nov-4-2019

arXiv.org PDF

Add feedback

Country:
- North America
  - United States
    - Pennsylvania > Allegheny County
      - Pittsburgh (0.04)
    - California > Los Angeles County
      - Long Beach (0.04)
  - Puerto Rico > San Juan
    - San Juan (0.04)
- Europe
  - United Kingdom > England
    - Greater London > London (0.04)
  - Sweden > Stockholm
    - Stockholm (0.04)
  - Hungary > Budapest
    - Budapest (0.04)
  - Germany > Bavaria
    - Upper Bavaria > Munich (0.04)
- Asia
  - Middle East
    - Jordan (0.05)
    - Republic of Türkiye > Karaman Province
      - Karaman (0.04)
  - Japan > Honshū
    - Tōhoku > Miyagi Prefecture
      - Sendai (0.04)
    - Kansai > Hyogo Prefecture
      - Kobe (0.04)

Genre:
- Research Report (0.64)

Industry:
- Leisure & Entertainment > Sports (0.68)
- Education > Educational Setting (0.46)
- Transportation
  - Ground > Road (0.48)
  - Passenger (0.48)

Technology:
- Information Technology > Artificial Intelligence
  - Robots (1.00)
  - Machine Learning
    - Reinforcement Learning (1.00)
    - Neural Networks > Deep Learning (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found