CoLoR-Filter: Conditional Loss Reduction Filtering for Targeted Language Model Pre-training
–Neural Information Processing Systems
The content of the data that a language model is trained on can have profound effects on its performance and the efficiency of the training process [Rae et al., 2021, Longpre et al., 2023, Penedo
Neural Information Processing Systems
Nov-20-2025, 02:13:29 GMT
- Country:
- Asia > Middle East
- Jordan (0.04)
- Europe
- Slovenia > Drava
- Municipality of Benedikt > Benedikt (0.04)
- United Kingdom > England
- Oxfordshire > Oxford (0.04)
- Slovenia > Drava
- North America
- Canada (0.04)
- United States
- Georgia > Gwinnett County (0.04)
- Washington > King County
- Seattle (0.04)
- Asia > Middle East
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (0.93)
- Research Report
- Industry:
- Government (0.46)
- Technology: