Cross-Lingual Supervision improves Large Language Models Pre-training

Open in new window