From English-Centric to Effective Bilingual: LLMs with Custom Tokenizers for Underrepresented Languages

Open in new window