Multimodal and Multilingual Embeddings for Large-Scale Speech Mining

Neural Information Processing Systems 

Our approach can also be used to directly perform speech-to-speech mining, without the need to first transcribe or translate the data.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found