Listen to Interpret: Post-hoc Interpretability for Audio Networks with NMF

Jan-19-2025, 04:01:51 GMT–Neural Information Processing Systems

This paper tackles post-hoc interpretability for audio processing networks. Our goal is to interpret decisions of a trained network in terms of high-level audio objects that are also listenable for the end-user. To this end, we propose a novel interpreter design that incorporates non-negative matrix factorization (NMF). In particular, a regularized interpreter module is trained to take hidden layer representations of the targeted network as input and produce time activations of pre-learnt NMF components as intermediate outputs. Our methodology allows us to generate intuitive audio-based interpretations that explicitly enhance parts of the input signal most relevant for a network's decision.

audio network, nmf, post-hoc interpretability, (1 more...)

Neural Information Processing Systems

Jan-19-2025, 04:01:51 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)