Data Selection for Language Models via Importance Resampling

Neural Information Processing Systems 

Selecting a suitable pretraining dataset is crucial for both general-domain (e.g., GPT -

Similar Docs  Excel Report  more

TitleSimilaritySource
None found