Harmony4D: A Video Dataset for In-The-Wild Close Human Interactions - Supplementary Materials - Rawal Khirodkar
–Neural Information Processing Systems
For video demos of Harmony4D, please visit: Harmony4D Website. Please do not share the dataset with anyone as it is not publicly available yet. Harmony4D is a 75-minute video dataset collected using over 20 eqidistant, synchronized GoPro cameras. It consists of 1.66M images and 3.32M human instances, divided into 1.28M images for We manually clipped the videos into 208 sequences across 6 different activities, ensuring each sequence is at least 5 seconds (100 frames) long for temporal continuity. The 2D bboxes are derived from projected SMPL human vertices.
Neural Information Processing Systems
Feb-17-2026, 23:21:36 GMT
- Country:
- Asia > Japan
- Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.05)
- Europe
- Netherlands > North Holland
- Amsterdam (0.04)
- Switzerland > Zürich
- Zürich (0.14)
- Netherlands > North Holland
- Asia > Japan
- Genre:
- Research Report (0.47)
- Industry:
- Law (0.94)
- Technology: