Optimizing Multilingual Text-To-Speech with Accents & Emotions