An EM Algorithm for Localizing Multiple Sound Sources in Reverberant Environments
Mandel, Michael I., Ellis, Daniel P., Jebara, Tony
–Neural Information Processing Systems
We present a method for localizing and separating sound sources in stereo recordings thatis robust to reverberation and does not make any assumptions about the source statistics. The method consists of a probabilistic model of binaural multisource recordingsand an expectation maximization algorithm for finding the maximum likelihood parameters of that model. These parameters include distributions over delays and assignments of time-frequency regions to sources. We evaluate this method against two comparable algorithms on simulations of simultaneous speech from two or three sources. Our method outperforms the others in anechoic conditionsand performs as well as the better of the two in the presence of reverberation.
Neural Information Processing Systems
Dec-31-2007