A Deeper Understanding of Deep Learning

May-21-2022, 06:10:18 GMT–Communications of the ACM

Deep learning should not work as well as it seems to: according to traditional statistics and machine learning, any analysis that has too many adjustable parameters will overfit noisy training data, and then fail when faced with novel test data. In clear violation of this principle, modern neural networks often use vastly more parameters than data points, but they nonetheless generalize to new data quite well. The shaky theoretical basis for generalization has been noted for many years. One proposal was that neural networks implicitly perform some sort of regularization--a statistical tool that penalizes the use of extra parameters. Yet efforts to formally characterize such an "implicit bias" toward smoother solutions have failed, said Roi Livni, an advanced lecturer in the department of electrical engineering of Israel's Tel Aviv University.

generalization, learning, neural network, (15 more...)

Communications of the ACM

May-21-2022, 06:10:18 GMT

Journals Web Page

Add feedback

Country:
- North America
  - United States
    - Massachusetts > Suffolk County
      - Boston (0.05)
    - California > San Diego County
      - San Diego (0.05)
  - Canada > Ontario
    - Toronto (0.05)
- Europe > Switzerland
  - Vaud > Lausanne (0.05)
- Asia > Middle East
  - Israel > Tel Aviv District > Tel Aviv (0.25)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.74)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found