Unified Microphone Conversion: Many-to-Many Device Mapping via Feature-wise Linear Modulation
Ryu, Myeonghoon, Oh, Hongseok, Lee, Suji, Park, Han
–arXiv.org Artificial Intelligence
In this study, we introduce Unified Microphone Conversion, a unified generative framework to enhance the resilience of sound event classification systems against device variability. Building on the limitations of previous works, we condition the generator network with frequency response information to achieve many-to-many device mapping. This approach overcomes the inherent limitation of CycleGAN, requiring separate models for each device pair. Our framework leverages the strengths of CycleGAN for unpaired training to simulate device characteristics in audio recordings and significantly extends its scalability by integrating frequency response related information via Feature-wise Linear Modulation. The experiment results show that our method outperforms the state-of-the-art method by 2.6% and reducing variability by 0.8% in macro-average F1 score.
arXiv.org Artificial Intelligence
Oct-23-2024
- Country:
- Asia > South Korea
- North America > United States
- California > San Diego County > San Diego (0.04)
- Genre:
- Research Report > New Finding (0.69)
- Industry:
- Leisure & Entertainment (0.36)
- Media (0.50)
- Technology: