UNSSOR: Unsupervised Neural Speech Separation by Leveraging Over-determined Training Mixtures
–Neural Information Processing Systems
In reverberant conditions with multiple concurrent speakers, each microphone acquires a mixture signal of multiple speakers at a different location. In over-determined conditions where the microphones out-number speakers, we can narrow down the solutions to speaker images and realize unsupervised speech separation by leveraging each mixture signal as a constraint (i.e., the estimated speaker images at a microphone should add up to the mixture).
Neural Information Processing Systems
Dec-25-2025, 21:42:25 GMT
- Technology: