EffiFusion-GAN: Efficient Fusion Generative Adversarial Network for Speech Enhancement

Wen, Bin, Tan, Tien-Ping

arXiv.org Artificial Intelligence 

-- We introduce EffiFusion - GAN (Efficient Fusion Generative Adversarial Network), a novel deep learning model designed to enhance speech processing tasks by leveragin g advanced techniques. Our model incorporates three primary innovations. Firstly, we employ Depthwise Separable Convolutions within a Multi - Scale Convolutional Block to significantly reduce computational complexity while capturin g rich features at multiple scales. This approach enhances the model's ability to process diverse auditory inputs efficiently. S econdly, the model features an enhanced attention mechanism that includes dual Layer Normalization and optimized residual connections, improving stability and convergence during trainin g. Lastly, dynamic pruning is applied to the convolutional layers, whic h reduces the model size without compromising performance, making it ideal for deployment in resource - constrain ed environments.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found