CoT-ST: Enhancing LLM-based Speech Translation with Multimodal Chain-of-Thought

Open in new window