Breaking the Glass Ceiling for Embedding-Based Classifiers for Large Output Spaces

May-27-2025, 12:26:39 GMT–Neural Information Processing Systems

In extreme classification settings, embedding-based neural network models are currently not competitive with sparse linear and tree-based methods in terms of accuracy. Most prior works attribute this poor performance to the low-dimensional bottleneck in embedding-based methods. In this paper, we demonstrate that theoretically there is no limitation to using low-dimensional embedding-based methods, and provide experimental evidence that overfitting is the root cause of the poor performance of embedding-based methods. These findings motivate us to investigate novel data augmentation and regularization techniques to mitigate overfitting. To this end, we propose GLaS, a new regularizer for embedding-based neural network approaches.

embedding-based classifier, embedding-based method, glass ceiling, (3 more...)

Neural Information Processing Systems

May-27-2025, 12:26:39 GMT

Conferences Web Page

Add feedback

Industry:
- Law > Civil Rights & Constitutional Law (0.40)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.89)