UNSSOR: Unsupervised Neural Speech Separation by Leveraging Over-determined Training Mixtures

Dec-25-2025, 21:42:25 GMT–Neural Information Processing Systems

In reverberant conditions with multiple concurrent speakers, each microphone acquires a mixture signal of multiple speakers at a different location. In over-determined conditions where the microphones out-number speakers, we can narrow down the solutions to speaker images and realize unsupervised speech separation by leveraging each mixture signal as a constraint (i.e., the estimated speaker images at a microphone should add up to the mixture).

artificial intelligence, machine learning, underline, (11 more...)

Neural Information Processing Systems

Dec-25-2025, 21:42:25 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.35)