Exploring Polyglot Harmony: On Multilingual Data Allocation for Large Language Models Pretraining

Open in new window