Progressive Data Dropout: An Embarrassingly Simple Approach to Train Faster
–Neural Information Processing Systems
The success of the machine learning field has reliably depended on training on large datasets. While effective, this trend comes at an extraordinary cost. This is due to two deeply intertwined factors: the size of models and the size of datasets. While promising research efforts focus on reducing the size of models, the other half of the equation remains fairly mysterious. Indeed, it is surprising that the standard approach to training remains to iterate over and over, uniformly sampling the training dataset.
Neural Information Processing Systems
Jun-23-2026, 01:47:40 GMT