Balanced Meta-Softmax for Long-Tailed Visual Recognition
–Neural Information Processing Systems
Deep classifiers have achieved great success in visual recognition. However, real-world data is long-tailed by nature, leading to the mismatch between training and testing distributions. In this paper, we show that the Softmax function, though used in most classification tasks, gives a biased gradient estimation under the long-tailed setup. This paper presents Balanced Softmax, an elegant unbiased extension of Softmax, to accommodate the label distribution shift between training and testing. Theoretically, we derive the generalization bound for multiclass Softmax regression and show our loss minimizes the bound.
Neural Information Processing Systems
Oct-9-2024, 20:33:36 GMT
- Technology: