Certifiably-correct Control Policies for Safe Learning and Adaptation in Assistive Robotics

Majd, Keyvan, Clark, Geoffrey, Khandait, Tanmay, Zhou, Siyu, Sankaranarayanan, Sriram, Fainekos, Georgios, Amor, Heni Ben

Mar-12-2023–arXiv.org Artificial Intelligence

Guaranteeing safety in human-centric applications is critical in robot learning as the learned policies may demonstrate unsafe behaviors in formerly unseen scenarios. We present a framework to locally repair an erroneous policy network to satisfy a set of formal safety constraints using Mixed Integer Quadratic Programming (MIQP). Our MIQP formulation explicitly imposes the safety constraints to the learned policy while minimizing the original loss function. The policy network is then verified to be locally safe. We demonstrate the application of our framework to derive safe policies for a robotic lower-leg prosthesis. Figure 1: Left: A trained neural-network policy to control a prosthesis violates formal safety constraints. Right: Our framework repairs the violation while maintaining the underlying behavior.

artificial intelligence, constraint, machine learning, (17 more...)

arXiv.org Artificial Intelligence

Mar-12-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.46)

Genre:
- Research Report (1.00)

Industry:
- Health & Medicine > Therapeutic Area (0.69)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks (1.00)
  - Representation & Reasoning (1.00)
  - Robots (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found