MultimodalandMultilingualEmbeddings forLarge-ScaleSpeechMining

Neural Information Processing Systems 

Using a similarity metric in that multimodal embedding space, we perform mining of audio in German, French, Spanish and English from Librivox against billions of sentences from CommonCrawl.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found