Exploiting Parallel Audio Recordings to Enforce Device Invariance in CNN-based Acoustic Scene Classification

Primus, Paul, Eghbal-zadeh, Hamid, Eitelsebner, David, Koutini, Khaled, Arzt, Andreas, Widmer, Gerhard

Sep-4-2019–arXiv.org Machine Learning

Distribution mismatches between the data seen at training and at application time remain a major challenge in all application areas of machine learning. We study this problem in the context of machine listening (Task 1b of the DCASE 2019 Challenge). We propose a novel approach to learn domain-invariant classifiers in an end-to-end fashion by enforcing equal hidden layer representations for domain-parallel samples, i.e. time-aligned recordings from different recording devices. No classification labels are needed for our domain adaptation (DA) method, which makes the data collection process cheaper.

accuracy, dataset, representation, (11 more...)

arXiv.org Machine Learning

Sep-4-2019

arXiv.org PDF

Add feedback

Country:
- South America > Chile
  - Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
- North America
  - Canada > British Columbia (0.04)
  - United States
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
    - California > San Diego County
      - San Diego (0.04)
- Europe
  - Spain (0.04)
  - France > Hauts-de-France
    - Nord > Lille (0.04)
  - Austria > Upper Austria
    - Linz (0.04)
- Asia > Middle East
  - Jordan (0.04)

Genre:
- Research Report > New Finding (0.68)

Industry:
- Media > Music (0.40)
- Leisure & Entertainment (0.40)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found