Latent Conditioned Loco-Manipulation Using Motion Priors
Stępień, Maciej, Kourdis, Rafael, Roux, Constant, Stasse, Olivier
–arXiv.org Artificial Intelligence
Figure 1: Our Solo12 loco-manipulation policy is able to execute and smoothly transition between locomotion and manipulation to reach a specified target. Abstract-- Although humanoid and quadruped robots provide a wide range of capabilities, current control methods, such as Deep Reinforcement Learning, focus mainly on single skills. This approach is inefficient for solving more complicated tasks where high-level goals, physical robot limitations and desired motion style might all need to be taken into account. A more effective approach is to first train a multipurpose motion policy that acquires low-level skills through imitation, while providing latent space control over skill execution. Then, this policy can be used to efficiently solve downstream tasks. This method has already been successful for controlling characters in computer graphics. In this work, we apply the approach to humanoid and quadrupedal loco-manipulation by imitating either simple synthetic motions or kinematically retargeted dog motions. We extend the original formulation to handle constraints, ensuring deployment safety, and use a diffusion discriminator for better imitation quality.
arXiv.org Artificial Intelligence
Sep-22-2025