An AI has learned how to pick a single voice out of a crowd

New Scientist 

Devices like Amazon's Echo and Google Home can usually deal with requests from a lone person, but like us they struggle in situations such as a noisy cocktail party, where several people are speaking at once. Now an AI that is able to separate the voices of multiple speakers in real time promises to give automatic speech recognition a big boost, and could soon find its way into an elevator near you. The technology, developed by researchers at the Mitsubishi Electric Research Laboratory in Cambridge, Massachusetts, was demonstrated in public for the first time at this month's Combined Exhibition of Advanced Technologies show in Tokyo. It uses a machine learning technique the team calls "deep clustering" to identifies unique features in the "voiceprint" of multiple speakers. It then groups the distinct features from each speaker's voice together, allowing it to disentangle multiple voices and then reconstruct what each person was saying.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found