Contact-conditioned learning of locomotion policies
Ciebielski, Michal, Khadiv, Majid
–arXiv.org Artificial Intelligence
Locomotion is realized through making and breaking contact. State-of-the-art constrained nonlinear model predictive controllers (NMPC) generate whole-body trajectories for a given contact sequence. However, these approaches are computationally expensive at run-time. Hence it is desirable to offload some of this computation to an offline phase. In this paper, we hypothesize that conditioning a learned policy on the locations and timings of contact is a suitable representation for learning a single policy that can generate multiple gaits (contact sequences). In this way, we can build a single generalist policy to realize different gaited and non-gaited locomotion skills and the transitions among them. Our extensive simulation results demonstrate the validity of our hypothesis for learning multiple gaits for a biped robot.
arXiv.org Artificial Intelligence
Jul-16-2024
- Country:
- Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
- Genre:
- Research Report > New Finding (0.88)
- Technology:
- Information Technology > Artificial Intelligence
- Robots > Locomotion (0.47)
- Machine Learning > Neural Networks (0.47)
- Information Technology > Artificial Intelligence