CoLoR-Filter: Conditional Loss Reduction Filtering for Targeted Language Model Pre-training

Neural Information Processing Systems 

The content of the data that a language model is trained on can have profound effects on its performance and the efficiency of the training process [Rae et al., 2021, Longpre et al., 2023, Penedo

Similar Docs  Excel Report  more

TitleSimilaritySource
None found