On decoder-only architecture for speech-to-text and large language model integration

Open in new window