From English-Centric to Effective Bilingual: LLMs with Custom Tokenizers for Underrepresented Languages