Efficient and Safe Exploration in Deterministic Markov Decision Processes with Unknown Transition Models
Bıyık, Erdem, Margoliash, Jonathan, Alimo, Shahrouz Ryan, Sadigh, Dorsa
–arXiv.org Artificial Intelligence
Process (MDP) using Gaussian processes. In their work, they assumed the transition model is known and that there exists I. INTRODUCTION a predefined safety function. Both of these assumptions can Guaranteeing safety is a vital issue for many modern be quite restrictive when the system is going to operate in robotics systems, such as unmanned aerial vehicles (UAVs), unknown environments. In our work, we plan to address autonomous cars, or domestic robots [1], [2], [3]. One both of these challenges by considering unknown transition approach is to attempt to specify all potential scenarios models, and no access to a predefined safety function.
arXiv.org Artificial Intelligence
Apr-1-2019
- Country:
- North America > United States > California
- Santa Clara County > Palo Alto (0.04)
- San Diego County > San Diego (0.04)
- North America > United States > California
- Genre:
- Research Report (0.64)
- Industry:
- Transportation (0.54)
- Information Technology > Robotics & Automation (0.54)
- Government (0.46)
- Automobiles & Trucks (0.34)
- Aerospace & Defense > Aircraft (0.34)