Beyond neural scaling laws: beating power law scaling via data pruning

Open in new window