ManiPose: Manifold-Constrained Multi-Hypothesis 3D Human Pose Estimation
–Neural Information Processing Systems
We propose ManiPose, a manifold-constrained multi-hypothesis model for human-pose 2D-to-3D lifting. We provide theoretical and empirical evidence that, due to the depth ambiguity inherent to monocular 3D human pose estimation, traditional regression models suffer from pose-topology consistency issues, which standard evaluation metrics (MPJPE, P-MPJPE and PCK) fail to assess. ManiPose addresses depth ambiguity by proposing multiple candidate 3D poses for each 2D input, each with its estimated plausibility. By constraining the outputs to lie on the human pose manifold, ManiPose guarantees the consistency of all hypothetical poses, in contrast to previous works. We showcase the performance of ManiPose on real-world datasets, where it outperforms state-of-the-art models in pose consistency by a large margin while being very competitive on the MPJPE metric.
Neural Information Processing Systems
May-27-2025, 15:26:50 GMT
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning (0.45)
- Robots > Humanoid Robots (0.66)
- Vision > Video Understanding (0.66)
- Information Technology > Artificial Intelligence