Differentiable Reward Optimization for LLM based TTS system

Open in new window