Long-Tail Theory under Gaussian Mixtures
Bolatov, Arman, Tezekbayev, Maxat, Melnykov, Igor, Pak, Artur, Nikoulina, Vassilina, Assylbekov, Zhenisbek
–arXiv.org Artificial Intelligence
We suggest a simple Gaussian mixture model for data generation that complies with Feldman's long tail theory (2020). We demonstrate that a linear classifier cannot decrease the generalization error below a certain level in the proposed model, whereas a nonlinear classifier with a memorization capacity can. This confirms that for long-tailed distributions, rare training examples must be considered for optimal generalization to new data. Finally, we show that the performance gap between linear and nonlinear models can be lessened as the tail becomes shorter in the subpopulation frequency distribution, as confirmed by experiments on synthetic and real data.
arXiv.org Artificial Intelligence
Jul-24-2023
- Country:
- South America > Colombia
- Meta Department > Villavicencio (0.04)
- North America
- United States
- Washington > King County
- Seattle (0.04)
- Minnesota
- St. Louis County > Duluth (0.14)
- Saint Louis County > Duluth (0.14)
- Indiana > Allen County
- Fort Wayne (0.04)
- Illinois > Cook County
- Chicago (0.04)
- Washington > King County
- Canada > Ontario
- Toronto (0.04)
- United States
- Europe
- France (0.04)
- United Kingdom > England
- Greater London > London (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Asia
- Middle East > Jordan (0.04)
- Kazakhstan > Akmola Region
- Astana (0.04)
- South America > Colombia
- Genre:
- Research Report > New Finding (0.93)
- Technology: