ISAC: An Invertible and Stable Auditory Filter Bank with Customizable Kernels for ML Integration

Haider, Daniel, Perfler, Felix, Balazs, Peter, Hollomey, Clara, Holighaus, Nicki

arXiv.org Artificial Intelligence 

This paper introduces ISAC, an invertible and stable, perceptually-motivated filter bank that is specifically designed to be integrated into machine learning paradigms. More precisely, the center frequencies and bandwidths of the filters are chosen to follow a non-linear, auditory frequency scale, the filter kernels have user-defined maximum temporal support and may serve as learnable convolutional kernels, and there exists a corresponding filter bank such that both form a perfect reconstruction pair. ISAC provides a powerful and user-friendly audio front-end suitable for any application, including analysis-synthesis schemes.