AQuA: A Benchmarking Tool for Label Quality Assessment
–Neural Information Processing Systems
Machine learning (ML) models are only as good as the data they are trained on. But recent studies have found datasets widely used to train and evaluate ML models, e.g. ImageNet, to have pervasive labeling errors. Erroneous labels on the train set hurt ML models' ability to generalize, and they impact evaluation and model selection using the test set. Consequently, learning in the presence of labeling errors is an active area of research, yet this field lacks a comprehensive benchmark to evaluate these methods.
Neural Information Processing Systems
Jan-20-2025, 03:18:05 GMT
- Technology: