Monaural Speech Separation

Dec-31-2003–Neural Information Processing Systems

Monaural speech separation has been studied in previous systems that incorporate auditory scene analysis principles. A major problem for these systems is their inability to deal with speech in the highfrequency range. Psychoacoustic evidence suggests that different perceptual mechanisms are involved in handling resolved and unresolved harmonics. Motivated by this, we propose a model for monaural separation that deals with low-frequency and highfrequency signals differently. For resolved harmonics, our model generates segments based on temporal continuity and cross-channel correlation, and groups them according to periodicity. For unresolved harmonics, the model generates segments based on amplitude modulation (AM) in addition to temporal continuity and groups them according to AM repetition rates derived from sinusoidal modeling. Underlying the separation process is a pitch contour obtained according to psychoacoustic constraints. Our model is systematically evaluated, and it yields substantially better performance than previous systems, especially in the high-frequency range.

pitch period, speech, target speech, (15 more...)

Neural Information Processing Systems

Dec-31-2003

Conferences PDF

Add feedback

Country:
- North America > United States
  - New York (0.04)
  - Ohio > Franklin County
    - Columbus (0.04)
  - Massachusetts > Middlesex County
    - Cambridge (0.04)
  - California > San Diego County
    - San Diego (0.04)
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.14)

Technology:
- Information Technology > Artificial Intelligence
  - Speech (0.47)
  - Machine Learning (0.46)

Duplicate Docs Excel Report

Title
Monaural Speech Separation
Monaural Speech Separation

Similar Docs Excel Report more

Title	Similarity	Source
None found