Exploiting Parallel Audio Recordings to Enforce Device Invariance in CNN-based Acoustic Scene Classification
Primus, Paul, Eghbal-zadeh, Hamid, Eitelsebner, David, Koutini, Khaled, Arzt, Andreas, Widmer, Gerhard
Distribution mismatches between the data seen at training and at application time remain a major challenge in all application areas of machine learning. We study this problem in the context of machine listening (Task 1b of the DCASE 2019 Challenge). We propose a novel approach to learn domain-invariant classifiers in an end-to-end fashion by enforcing equal hidden layer representations for domain-parallel samples, i.e. time-aligned recordings from different recording devices. No classification labels are needed for our domain adaptation (DA) method, which makes the data collection process cheaper.
Sep-4-2019
- Country:
- South America > Chile
- North America
- Canada > British Columbia (0.04)
- United States
- Louisiana > Orleans Parish
- New Orleans (0.04)
- California > San Diego County
- San Diego (0.04)
- Louisiana > Orleans Parish
- Europe
- Spain (0.04)
- France > Hauts-de-France
- Austria > Upper Austria
- Linz (0.04)
- Asia > Middle East
- Jordan (0.04)
- Genre:
- Research Report > New Finding (0.68)
- Industry:
- Media > Music (0.40)
- Leisure & Entertainment (0.40)
- Technology: