ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation
–Neural Information Processing Systems
We present ComSL, a speech-language model built atop a composite architecture of public pretrained speech-only and language-only models and optimized data-efficiently for spoken language tasks.
Neural Information Processing Systems
Feb-16-2026, 17:10:54 GMT