PHUMA: Physically-Grounded Humanoid Locomotion Dataset
Lee, Kyungmin, Kim, Sibeen, Park, Minho, Kim, Hyunseung, Hwang, Dongyoon, Lee, Hojoon, Choo, Jaegul
–arXiv.org Artificial Intelligence
Each column illustrates four failure modes: joint violation, floating, penetration, and skating. Humanoid-X (Mao et al., 2025) (top row) often exhibits these issues due to direct video-to-motion conversion, while PHUMA (bottom row) mitigates those violations through careful data curation and physically grounded retargeting. Motion imitation is a promising approach for humanoid locomotion, enabling agents to acquire humanlike behaviors. Existing methods typically rely on high-quality motion capture datasets such as AMASS, but these are scarce and expensive, limiting scalability and diversity. Recent studies attempt to scale data collection by converting large-scale internet videos, exemplified by Humanoid-X. However, they often introduce physical artifacts such as floating, penetration, and foot skating, which hinder stable imitation. In response, we introduce PHUMA, a Physically-grounded HUMAnoid locomotion dataset that leverages human video at scale, while addressing physical artifacts through careful data curation and physics-constrained retargeting. PHUMA enforces joint limits, ensures ground contact, and eliminates foot skating, producing motions that are both large-scale and physically reliable. We evaluated PHUMA in two sets of conditions: (i) imitation of unseen motion from self-recorded test videos and (ii) path following with pelvis-only guidance. In both cases, PHUMA-trained policies outperform Humanoid-X and AMASS, achieving significant gains in imitating diverse motions. The code is available at https://davian-robotics.github.io/PHUMA. Humanoid robots are central to the pursuit of general-purpose embodied AI, but their deployment in real-world first requires locomotion that is both stable and humanlike.
arXiv.org Artificial Intelligence
Oct-31-2025
- Country:
- Asia > Japan
- Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.04)
- North America > United States
- Pennsylvania > Allegheny County > Pittsburgh (0.04)
- Asia > Japan
- Genre:
- Research Report (1.00)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning (1.00)
- Robots > Humanoid Robots (0.35)
- Vision > Video Understanding (0.35)
- Information Technology > Artificial Intelligence