ScanMix: Learning from Severe Label Noise via Semantic Clustering and Semi-Supervised Learning
Sachdeva, Ragav, Cordeiro, Filipe R, Belagiannis, Vasileios, Reid, Ian, Carneiro, Gustavo
–arXiv.org Artificial Intelligence
We propose a new training algorithm, ScanMix, that explores semantic clustering and semi-supervised learning (SSL) to allow superior robustness to severe label noise and competitive robustness to non-severe label noise problems, in comparison to the state of the art (SOTA) methods. ScanMix is based on the expectation maximisation framework, where the E-step estimates the latent variable to cluster the training images based on their appearance and classification results, and the M-step optimises the SSL classification and learns effective feature representations via semantic clustering. We present a theoretical result that shows the correctness and convergence of ScanMix, and an empirical result that shows that ScanMix has SOTA results on CIFAR-10/-100 (with symmetric, asymmetric and semantic label noise), Red Mini-ImageNet (from the Controlled Noisy Web Labels), Clothing1M and WebVision. In all benchmarks with severe label noise, our results are competitive to the current SOTA.
arXiv.org Artificial Intelligence
Oct-16-2022
- Country:
- Oceania > Australia (0.04)
- South America > Brazil
- Pernambuco (0.04)
- Europe
- United Kingdom > England
- Oxfordshire > Oxford (0.14)
- Surrey (0.04)
- Germany > Saxony-Anhalt
- Magdeburg (0.04)
- United Kingdom > England
- Genre:
- Research Report > New Finding (0.48)
- Technology: