Extending DNN-based Multiplicative Masking to Deep Subband Filtering for Improved Dereverberation
Lemercier, Jean-Marie, Tobergte, Julian, Gerkmann, Timo
–arXiv.org Artificial Intelligence
In this paper, we present a scheme for extending deep neural network-based multiplicative maskers to deep subband filters for speech restoration in the time-frequency domain. The resulting method can be generically applied to any deep neural network providing masks in the time-frequency domain, while requiring only few more trainable parameters and a computational overhead that is negligible for state-of-the-art neural networks. We demonstrate that the resulting deep subband filtering scheme outperforms multiplicative masking for dereverberation, while leaving the denoising performance virtually the same. We argue that this is because deep subband filtering in the time-frequency domain fits the subband approximation often assumed in the dereverberation literature, whereas multiplicative masking corresponds to the narrowband approximation generally employed for denoising.
arXiv.org Artificial Intelligence
May-31-2023
- Country:
- North America
- United States
- Utah > Salt Lake County
- Salt Lake City (0.04)
- Nevada > Clark County
- Las Vegas (0.04)
- Utah > Salt Lake County
- Canada
- Ontario > Toronto (0.04)
- Alberta > Census Division No. 6
- Calgary Metropolitan Region > Calgary (0.04)
- United States
- Europe
- Germany > Hamburg (0.04)
- United Kingdom > England
- East Sussex > Brighton (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Italy > Tuscany
- Florence (0.04)
- Czechia > South Moravian Region
- Brno (0.04)
- Asia
- South Korea > Incheon
- Incheon (0.04)
- Singapore > Central Region
- Singapore (0.04)
- Japan > Honshū
- Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
- South Korea > Incheon
- North America
- Genre:
- Research Report (0.50)
- Technology: