JELLY: Joint Emotion Recognition and Context Reasoning with LLMs for Conversational Speech Synthesis