Are all Frames Equal? Active Sparse Labeling for Video Action Detection

Oct-11-2024, 05:54:03 GMT–Neural Information Processing Systems

Video action detection requires annotations at every frame, which drastically increases the labeling cost. In this work, we focus on efficient labeling of videos for action detection to minimize this cost. We propose active sparse labeling (ASL), a novel active learning strategy for video action detection. Sparse labeling will reduce the annotation cost but poses two main challenges; 1) how to estimate the utility of annotating a single frame for action detection as detection is performed at video level?, and 2) how these sparse labels can be used for action detection which require annotations on all the frames? This work attempts to address these challenges within a simple active learning framework.

action detection, active sparse, video action detection, (3 more...)

Neural Information Processing Systems

Oct-11-2024, 05:54:03 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.67)