LMAC-TD: Producing Time Domain Explanations for Audio Classifiers
Mancini, Eleonora, Paissan, Francesco, Ravanelli, Mirco, Subakan, Cem
–arXiv.org Artificial Intelligence
Neural networks are typically black-boxes that remain opaque with regards to their decision mechanisms. Several works in the literature have proposed post-hoc explanation methods to alleviate this issue. This paper proposes LMAC-TD, a post-hoc explanation method that trains a decoder to produce explanations directly in the time domain. This methodology builds upon the foundation of L-MAC, Listenable Maps for Audio Classifiers, a method that produces faithful and listenable explanations. We incorporate SepFormer, a popular transformer-based time-domain source separation architecture. We show through a user study that LMAC-TD significantly improves the audio quality of the produced explanations while not sacrificing from faithfulness.
arXiv.org Artificial Intelligence
Sep-13-2024
- Country:
- Asia
- Europe > Italy
- Emilia-Romagna > Metropolitan City of Bologna > Bologna (0.04)
- North America
- Canada > Quebec (0.04)
- United States > New York
- New York County > New York City (0.04)
- Genre:
- Research Report (1.00)
- Technology: