Dual-Diffusion for Binocular 3D Human Pose Estimation
–Neural Information Processing Systems
Binocular 3D human pose estimation (HPE), reconstructing a 3D pose from 2D poses of two views, offers practical advantages by combining multiview geometry with the convenience of a monocular setup. However, compared to a multiview setup, the reduction in the number of cameras increases uncertainty in 3D reconstruction. To address this issue, we leverage the diffusion model, which has shown success in monocular 3D HPE by recovering 3D poses from noisy data with high uncertainty. Yet, the uncertainty distribution of initial 3D poses remains unknown. Considering that 3D errors stem from 2D errors within geometric constraints, we recognize that the uncertainties of 3D and 2D are integrated in a binocular configuration, with the initial 2D uncertainty being well-defined. Based on this insight, we propose Dual-Diffusion specifically for Binocular 3D HPE, simultaneously denoising the uncertainties in 2D and 3D, and recovering plausible and accurate results.
Neural Information Processing Systems
May-27-2025, 08:33:01 GMT
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning (0.42)
- Robots > Humanoid Robots (0.64)
- Vision > Video Understanding (0.64)
- Information Technology > Artificial Intelligence