Contact-conditioned learning of locomotion policies

Jul-16-2024–arXiv.org Artificial Intelligence

Locomotion is realized through making and breaking contact. State-of-the-art constrained nonlinear model predictive controllers (NMPC) generate whole-body trajectories for a given contact sequence. However, these approaches are computationally expensive at run-time. Hence it is desirable to offload some of this computation to an offline phase. In this paper, we hypothesize that conditioning a learned policy on the locations and timings of contact is a suitable representation for learning a single policy that can generate multiple gaits (contact sequences). In this way, we can build a single generalist policy to realize different gaited and non-gaited locomotion skills and the transitions among them. Our extensive simulation results demonstrate the validity of our hypothesis for learning multiple gaits for a biped robot.

contact-conditioned policy, learning, representation, (16 more...)

arXiv.org Artificial Intelligence

Jul-16-2024

arXiv.org PDF

Add feedback

Country:
- Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)

Genre:
- Research Report > New Finding (0.88)

Technology:
- Information Technology > Artificial Intelligence
  - Robots > Locomotion (0.47)
  - Machine Learning > Neural Networks (0.47)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found