Weakly-Supervised Temporal Localization via Occurrence Count Learning
Schroeter, Julien, Sidorov, Kirill, Marshall, David
We propose a novel model for temporal detection and localization which allows the training of deep neural networks using only counts of event occurrences as training labels. This powerful weakly-supervised framework alleviates the burden of the imprecise and time-consuming process of annotating event locations in temporal data. Unlike existing methods, in which localization is explicitly achieved by design, our model learns localization implicitly as a byproduct of learning to count instances. This unique feature is a direct consequence of the model's theoretical properties. We validate the effectiveness of our approach in a number of experiments (drum hit and piano onset detection in audio, digit detection in images) and demonstrate performance comparable to that of fully-supervised state-of-the-art methods, despite much weaker training requirements.
May-17-2019
- Country:
- Europe > United Kingdom (0.04)
- North America > United States
- California > Los Angeles County > Long Beach (0.04)
- Genre:
- Research Report
- Experimental Study (0.46)
- Promising Solution (0.54)
- Research Report
- Industry:
- Education (0.34)
- Technology: