DataComp-LM: In search of the next generation of training sets for language models Jeffrey Li* 1, 2 Alex Fang

Neural Information Processing Systems 

As a baseline for DCLM, we conduct extensive experiments and find that model-based filtering is key to assembling a high-quality training set.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found