JELLY: Joint Emotion Recognition and Context Reasoning with LLMs for Conversational Speech Synthesis

Open in new window