Beyond Confusion: A Fine-grained Dialectical Examination of Human Activity Recognition Benchmark Datasets

Geissler, Daniel, Nshimyimana, Dominique, Rey, Vitor Fortes, Suh, Sungho, Zhou, Bo, Lukowicz, Paul

Dec-12-2024–arXiv.org Artificial Intelligence

The research of machine learning (ML) algorithms for human activity recognition (HAR) has made significant progress with publicly available datasets. However, most research prioritizes statistical metrics over examining negative sample details. While recent models like transformers have been applied to HAR datasets with limited success from the benchmark metrics, their counterparts have effectively solved problems on similar levels with near 100% accuracy. This raises questions about the limitations of current approaches. This paper aims to address these open questions by conducting a fine-grained inspection of six popular HAR benchmark datasets. We identified for some parts of the data, none of the six chosen state-of-the-art ML methods can correctly classify, denoted as the intersect of false classifications (IFC). Analysis of the IFC reveals several underlying problems, including ambiguous annotations, irregularities during recording execution, and misaligned transition periods. We contribute to the field by quantifying and characterizing annotated data ambiguities, providing a trinary categorization mask for dataset patching, and stressing potential improvements for future data collections.

artificial intelligence, machine learning, natural language, (14 more...)

arXiv.org Artificial Intelligence

Dec-12-2024

arXiv.org PDF

Add feedback

Country:
- South America > Argentina
  - Pampas > Buenos Aires F.D. > Buenos Aires (0.04)
- North America
  - United States > New York
    - New York County > New York City (0.04)
  - Canada > Newfoundland and Labrador
    - Labrador (0.04)
- Europe
  - Italy > Emilia-Romagna
    - Metropolitan City of Bologna > Bologna (0.04)
  - Germany > Rhineland-Palatinate
    - Kaiserslautern (0.05)
- Asia > Japan
  - Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)

Genre:
- Research Report > New Finding (0.93)

Industry:
- Information Technology (1.00)
- Health & Medicine
  - Consumer Health (0.46)
  - Therapeutic Area > Cardiology/Vascular Diseases (0.46)
  - Health Care Technology (0.46)

Technology:
- Information Technology
  - Sensing and Signal Processing (1.00)
  - Communications > Networks
    - Sensor Networks (0.67)
  - Artificial Intelligence
    - Representation & Reasoning (1.00)
    - Natural Language (0.93)
    - Machine Learning
      - Neural Networks > Deep Learning (1.00)
      - Learning Graphical Models > Undirected Networks
        Markov Models (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found