Learning Spatially-Aware Language and Audio Embeddings
–Neural Information Processing Systems
Humans can picture a sound scene given an imprecise natural language description. For example, it is easy to imagine an acoustic environment given a phrase like "the
Neural Information Processing Systems
Feb-11-2026, 10:46:19 GMT
- Country:
- Europe > Slovenia > Drava > Municipality of Benedikt > Benedikt (0.04)
- Genre:
- Research Report > Experimental Study (1.00)
- Industry:
- Leisure & Entertainment (1.00)
- Media (0.67)
- Technology: