LMFCA-Net: A Lightweight Model for Multi-Channel Speech Enhancement with Efficient Narrow-Band and Cross-Band Attention
Zhang, Yaokai, Pei, Hanchen, Wang, Wanqi, Huang, Gongping
–arXiv.org Artificial Intelligence
Deep learning based end-to-end multi-channel speech enhancement methods have achieved impressive performance by leveraging sub-band, cross-band, and spatial information. However, these methods often demand substantial computational resources, limiting their practicality on terminal devices. This paper presents a lightweight multi-channel speech enhancement network with decoupled fully connected attention (LMFCA-Net). The proposed LMFCA-Net introduces time-axis decoupled fully-connected attention (T-FCA) and frequency-axis decoupled fully-connected attention (F-FCA) mechanisms to effectively capture long-range narrow-band and cross-band information without recurrent units. Experimental results show that LMFCA-Net performs comparably to state-of-the-art methods while significantly reducing computational complexity and latency, making it a promising solution for practical applications.
arXiv.org Artificial Intelligence
Feb-17-2025
- Country:
- Asia (0.29)
- Genre:
- Research Report
- New Finding (0.66)
- Promising Solution (0.54)
- Research Report
- Industry:
- Health & Medicine (0.31)
- Technology: