ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation
–Neural Information Processing Systems
We present ComSL, a speech-language model built atop a composite architecture of public pretrained speech-only and language-only models and optimized data-efficiently for spoken language tasks.
Neural Information Processing Systems
Oct-9-2025, 05:32:30 GMT