Why Big Is Not Always Better In Machine Learning

Dec-11-2019, 15:15:38 GMT–#artificialintelligence

Neural networks are trained to exactly fit the data. Such models usually would be considered as over-fitting, and yet they have managed to obtain high accuracy on test data. It is counter-intuitive -- but it works. This has raised many eyebrows, especially regarding the mathematical foundations of machine learning and their relevance to practitioners. In order to address these contradictions, researchers at OpenAI, in their recent work, double down on this widely believed grand illusion of bigger is better. In this paper, an attempt has been made to reconcile classical understanding and modern practice within a unified performance curve.

bigger model, machine learning, u-shaped bias-variance trade-off curve, (6 more...)

#artificialintelligence

Dec-11-2019, 15:15:38 GMT

News Web Page

Add feedback

AI-Alerts:
- 2019 > 2019-12 > AAAI AI-Alert for Dec 17, 2019 (1.00)

Technology:
- Information Technology
  - Artificial Intelligence > Machine Learning
    - Neural Networks (0.56)
  - Data Science > Data Mining
    - Big Data (0.40)