Listening to Sounds of Silence for Speech Denoising

Oct-10-2024, 11:43:05 GMT–Neural Information Processing Systems

We introduce a deep learning model for speech denoising, a long-standing challenge in audio analysis arising in numerous applications. Our approach is based on a key observation about human speech: there is often a short pause between each sentence or word. In a recorded speech signal, those pauses introduce a series of time periods during which only noise is present. We leverage these incidental silent intervals to learn a model for automatic speech denoising given only mono-channel audio. Detected silent intervals over time expose not just pure noise but its time-varying features, allowing the model to learn noise dynamics and suppress it from the speech signal.

listening, silent interval, speech denoising, (1 more...)

Neural Information Processing Systems

Oct-10-2024, 11:43:05 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Speech (0.69)
  - Machine Learning > Neural Networks
    - Deep Learning (0.64)