SoftDedup: an Efficient Data Reweighting Method for Speeding Up Language Model Pre-training

Open in new window