Probabilistic Inference of Speech Signals from Phaseless Spectrograms

Achan, Kannan, Roweis, Sam T., Frey, Brendan J.

Dec-31-2004–Neural Information Processing Systems

Many techniques for complex speech processing such as denoising and deconvolution, time/frequency warping, multiple speaker separation, and multiple microphone analysis operate on sequences of short-time power spectra (spectrograms), a representation which is often well-suited to these tasks. However, a significant problem with algorithms that manipulate spectrograms is that the output spectrogram does not include a phase component, which is needed to create a time-domain signal that has good perceptual quality. Here we describe a generative model of time-domain speech signals and their spectrograms, and show how an efficient optimizer can be used to find the maximum a posteriori speech signal, given the spectrogram.

algorithm, artificial intelligence, spectrogram, (14 more...)

Neural Information Processing Systems

Dec-31-2004

Conferences PDF

Add feedback

Country:
- North America > Canada > Ontario > Toronto (0.15)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Uncertainty (0.65)
  - Speech (1.00)

Duplicate Docs Excel Report

Title
Probabilistic Inference of Speech Signals from Phaseless Spectrograms
Probabilistic Inference of Speech Signals from Phaseless Spectrograms

Similar Docs Excel Report more

Title	Similarity	Source
None found