GRAPE: Optimize Data Mixture for Group Robust Multi-target Adaptive Pretraining
–Neural Information Processing Systems
The performance of large language models (LLMs) across diverse downstream applications is fundamentally governed by the quality and composition of their pretraining corpora.
Neural Information Processing Systems
Jun-23-2026, 00:52:15 GMT
- Genre:
- Research Report > Experimental Study (1.00)
- Industry:
- Leisure & Entertainment (1.00)
- Education > Curriculum (0.68)
- Law (0.67)
- Health & Medicine > Therapeutic Area (0.46)
- Technology: