Learning Spatially-Aware Language and Audio Embeddings

Neural Information Processing Systems 

Humans can picture a sound scene given an imprecise natural language description. For example, it is easy to imagine an acoustic environment given a phrase like "the

Similar Docs  Excel Report  more

TitleSimilaritySource
None found