Motion4D: Learning 3D-Consistent Motion and Semantics for 4D Scene Understanding
–Neural Information Processing Systems
Recent advancements in foundation models for 2D vision have substantially improved the analysis of dynamic scenes from monocular videos. However, despite their strong generalization capabilities, these models often lack 3D consistency, a fundamental requirement for understanding scene geometry and motion, thereby causing severe spatial misalignment and temporal flickering in complex 3D environments.
Neural Information Processing Systems
Jun-13-2026, 17:57:35 GMT
- Technology:
- Information Technology > Artificial Intelligence > Vision (1.00)