Robust Data Pruning under Label Noise via Maximizing Re-labeling Accuracy
–Neural Information Processing Systems
Data pruning, which aims to downsize a large training set into a small informative subset, is crucial for reducing the enormous computational costs of modern deep learning. Though large-scale data collections invariably contain annotation noise and numerous robust learning methods have been developed, data pruning for the noise-robust learning scenario has received little attention. With state-ofthe-art Re-labeling methods that self-correct erroneous labels while training, it is challenging to identify which subset induces the most accurate re-labeling of erroneous labels in the entire training set.
Neural Information Processing Systems
Apr-30-2026, 04:54:09 GMT