When and Why Are Deep Networks Better Than Shallow Ones?

Mhaskar, Hrushikesh (California Institute of Technology) | Liao, Qianli (Massachusetts Institute of Technology) | Poggio, Tomaso (Massachusetts Institute of Technology)

Feb-14-2017–AAAI Conferences

While the universal approximation property holds both for hierarchical and shallow networks, deep networks can approximate the class of compositional functions as well as shallow networks but with exponentially lower number of training parameters and sample complexity. Compositional functions are obtained as a hierarchy of local constituent functions, where "local functions'' are functions with low dimensionality. This theorem proves an old conjecture by Bengio on the role of depth in networks, characterizing precisely the conditions under which it holds. It also suggests possible answers to the the puzzle of why high-dimensional deep networks trained on large training sets often do not seem to show overfit.

deep learning, deep network, neural network, (19 more...)

AAAI Conferences

Feb-14-2017

Conferences PDF

Add feedback

Country:
- North America > United States > California > Los Angeles County (0.14)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found