Goto

Collaborating Authors

 Asia









Harmony4D: A Video Dataset for In-The-Wild Close Human Interactions - Supplementary Materials - Rawal Khirodkar

Neural Information Processing Systems

For video demos of Harmony4D, please visit: Harmony4D Website. Please do not share the dataset with anyone as it is not publicly available yet. Harmony4D is a 75-minute video dataset collected using over 20 eqidistant, synchronized GoPro cameras. It consists of 1.66M images and 3.32M human instances, divided into 1.28M images for We manually clipped the videos into 208 sequences across 6 different activities, ensuring each sequence is at least 5 seconds (100 frames) long for temporal continuity. The 2D bboxes are derived from projected SMPL human vertices.


Harmony4D: A Video Dataset for In-The-Wild Close Human Interactions

Neural Information Processing Systems

Understanding how humans interact with each other is key to building realistic multi-human virtual reality systems. This area remains relatively unexplored due to the lack of large-scale datasets. Recent datasets focusing on this issue mainly consist of activities captured entirely in controlled indoor environments with choreographed actions, significantly affecting their diversity.