Beyond Confusion: A Fine-grained Dialectical Examination of Human Activity Recognition Benchmark Datasets
Geissler, Daniel, Nshimyimana, Dominique, Rey, Vitor Fortes, Suh, Sungho, Zhou, Bo, Lukowicz, Paul
–arXiv.org Artificial Intelligence
The research of machine learning (ML) algorithms for human activity recognition (HAR) has made significant progress with publicly available datasets. However, most research prioritizes statistical metrics over examining negative sample details. While recent models like transformers have been applied to HAR datasets with limited success from the benchmark metrics, their counterparts have effectively solved problems on similar levels with near 100% accuracy. This raises questions about the limitations of current approaches. This paper aims to address these open questions by conducting a fine-grained inspection of six popular HAR benchmark datasets. We identified for some parts of the data, none of the six chosen state-of-the-art ML methods can correctly classify, denoted as the intersect of false classifications (IFC). Analysis of the IFC reveals several underlying problems, including ambiguous annotations, irregularities during recording execution, and misaligned transition periods. We contribute to the field by quantifying and characterizing annotated data ambiguities, providing a trinary categorization mask for dataset patching, and stressing potential improvements for future data collections.
arXiv.org Artificial Intelligence
Dec-12-2024
- Country:
- Asia > Japan
- Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
- Europe
- North America
- Canada > Newfoundland and Labrador
- Labrador (0.04)
- United States > New York
- New York County > New York City (0.04)
- Canada > Newfoundland and Labrador
- South America > Argentina
- Pampas > Buenos Aires F.D. > Buenos Aires (0.04)
- Asia > Japan
- Genre:
- Research Report > New Finding (0.93)
- Industry:
- Health & Medicine
- Consumer Health (0.46)
- Health Care Technology (0.46)
- Therapeutic Area > Cardiology/Vascular Diseases (0.46)
- Information Technology (1.00)
- Health & Medicine
- Technology: