GoSafeOpt: Scalable Safe Exploration for Global Optimization of Dynamical Systems

Sukhija, Bhavya, Turchetta, Matteo, Lindner, David, Krause, Andreas, Trimpe, Sebastian, Baumann, Dominik

Jun-12-2023–arXiv.org Artificial Intelligence

Learning optimal control policies directly on physical systems is challenging since even a single failure can lead to costly hardware damage. Most existing model-free learning methods that guarantee safety, i.e., no failures, during exploration are limited to local optima. A notable exception is the GoSafe algorithm, which, unfortunately, cannot handle high-dimensional systems and hence cannot be applied to most real-world dynamical systems. This work proposes GoSafeOpt as the first algorithm that can safely discover globally optimal policies for high-dimensional systems while giving safety and optimality guarantees. We demonstrate the superiority of GoSafeOpt over competing model-free safe learning methods on a robot arm that would be prohibitive for GoSafe.

artificial intelligence, gosafeopt, machine learning, (17 more...)

arXiv.org Artificial Intelligence

Jun-12-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Massachusetts > Middlesex County > Cambridge (0.04)
- Europe
  - Germany > North Rhine-Westphalia (0.04)
  - Finland (0.04)
  - Switzerland > Zürich
    - Zürich (0.14)
  - Sweden > Uppsala County
    - Uppsala (0.04)

Genre:
- Research Report > New Finding (0.46)

Industry:
- Government > Regional Government (0.45)

Technology:
- Information Technology > Artificial Intelligence
  - Robots (1.00)
  - Representation & Reasoning > Optimization (1.00)
  - Machine Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found