A Classification-based Cocktail-party Processor

Roman, Nicoleta, Wang, Deliang, Brown, Guy J.

Dec-31-2004–Neural Information Processing Systems

At a cocktail party, a listener can selectively attend to a single voice and filter out other acoustical interferences. How to simulate this perceptual ability remains a great challenge. This paper describes a novel supervised learning approach to speech segregation, in which a target speech signal is separated from interfering sounds using spatial location cues: interaural time differences (ITD) and interaural intensity differences (IID). Motivated by the auditory masking effect, we employ the notion of an ideal time-frequency binary mask, which selects the target if it is stronger than the interference in a local time-frequency unit. Within a narrow frequency band, modifications to the relative strength of the target source with respect to the interference trigger systematic changes for estimated ITD and IID.

inductive learning, interference, speech recognition, (18 more...)

Neural Information Processing Systems

Dec-31-2004

Conferences PDF

Add feedback

Country:
- North America > United States > Ohio (0.14)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Inductive Learning (0.34)
  - Speech (0.90)

Duplicate Docs Excel Report

Title
A Classification-based Cocktail-party Processor
A Classification-based Cocktail-party Processor

Similar Docs Excel Report more

Title	Similarity	Source
None found