Identifying Mislabeled Data using the Area Under the Margin Ranking

Neural Information Processing Systems 

Our goal is to automatically identify and subsequently remove mislabeled samples from training datasets. Discarding these harmful data will reduce memorization and improve generalization.