Hierarchical World Models as Visual Whole-Body Humanoid Controllers
Hansen, Nicklas, S, Jyothir V, Sobal, Vlad, LeCun, Yann, Wang, Xiaolong, Su, Hao
–arXiv.org Artificial Intelligence
Whole-body control for humanoids is challenging due to the high-dimensional nature of the problem, coupled with the inherent instability of a bipedal morphology. Learning from visual observations further exacerbates this difficulty. In this work, we explore highly data-driven approaches to visual whole-body humanoid control based on reinforcement learning, without any simplifying assumptions, reward design, or skill primitives. Specifically, we propose a hierarchical world model in which a high-level agent generates commands based on visual observations for a low-level agent to execute, both of which are trained with rewards. Our approach produces highly performant control policies in 8 tasks with a simulated 56-DoF humanoid, while synthesizing motions that are broadly preferred by humans.
arXiv.org Artificial Intelligence
May-31-2024
- Country:
- Asia > South Korea
- North America > United States
- California > San Diego County
- San Diego (0.04)
- New York (0.04)
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- California > San Diego County
- Genre:
- Research Report (1.00)
- Technology: