When does label smoothing help?
Rafael Müller, Simon Kornblith, Geoffrey E. Hinton
–Neural Information Processing Systems
To explain these observations, we visualize how label smoothing changes therepresentations learned bythepenultimate layerofthenetwork. We show that label smoothing encourages the representations of training examples from thesame class togroup intight clusters. This results inloss ofinformation inthe logits about resemblances between instances ofdifferent classes, which isnecessary for distillation, but does not hurt generalization or calibration of the model'spredictions.
Neural Information Processing Systems
Feb-15-2026, 01:13:10 GMT
- Country:
- North America > Canada
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- Ontario > Toronto (0.04)
- British Columbia > Metro Vancouver Regional District
- North America > Canada
- Technology: