Unsupervised Cross-Modal Alignment of Speech and Text Embedding Spaces
Yu-An Chung, Wei-Hung Weng, Schrasing Tong, James Glass
–Neural Information Processing Systems
Recently, there is an increasing interest in learning the semantics of a language directly, and only from rawspeech [24,27,28].
Neural Information Processing Systems
Feb-12-2026, 09:31:31 GMT
- Country:
- Europe > Italy
- Calabria > Catanzaro Province > Catanzaro (0.04)
- North America
- Canada > Quebec
- Montreal (0.04)
- United States > Massachusetts
- Middlesex County > Cambridge (0.04)
- Canada > Quebec
- Europe > Italy
- Technology: