EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding
Li, Yuan-Ming, Huang, Wei-Jin, Wang, An-Lan, Zeng, Ling-An, Meng, Jing-Ke, Zheng, Wei-Shi
–arXiv.org Artificial Intelligence
We present EgoExo-Fitness, a new full-body action understanding dataset, featuring fitness sequence videos recorded from synchronized egocentric and fixed exocentric (third-person) cameras. Compared with existing full-body action understanding datasets, EgoExo-Fitness not only contains videos from first-person perspectives, but also provides rich annotations. Specifically, two-level temporal boundaries are provided to localize single action videos along with sub-steps of each action. More importantly, EgoExo-Fitness introduces innovative annotations for interpretable action judgement--including technical keypoint verification, natural language comments on action execution, and action quality scores. Combining all of these, EgoExo-Fitness provides new resources to study egocentric and exocentric full-body action understanding across dimensions of "what", "when", and "how well". To facilitate research on egocentric and exocentric full-body action understanding, we construct benchmarks on a suite of tasks (i.e., action classification, action localization, cross-view sequence verification, cross-view skill determination, and a newly proposed task of guidance-based execution verification), together with detailed analysis. Code and data will be available at https://github.com/iSEE-Laboratory/EgoExo-Fitness/tree/main.
arXiv.org Artificial Intelligence
Jul-16-2024
- Country:
- Asia > China > Guangdong Province (0.14)
- Genre:
- Research Report (0.40)
- Industry:
- Education (0.67)
- Health & Medicine (1.00)
- Leisure & Entertainment > Sports (0.93)
- Technology: