GRAPE: Optimize Data Mixture for Group Robust Multi-target Adaptive Pretraining

Jun-23-2026, 00:52:15 GMT–Neural Information Processing Systems

The performance of large language models (LLMs) across diverse downstream applications is fundamentally governed by the quality and composition of their pretraining corpora.

domain weight, machine learning, natural language, (21 more...)

Neural Information Processing Systems

Jun-23-2026, 00:52:15 GMT

Conferences PDF

Genre:
- Research Report > Experimental Study (1.00)

Industry:
- Leisure & Entertainment (1.00)
- Education > Curriculum (0.68)
- Law (0.67)
- Health & Medicine > Therapeutic Area (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (1.00)
  - Machine Learning > Statistical Learning (0.93)
  - Representation & Reasoning > Optimization (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found