Greedy Layer-Wise Training of Deep Networks

Bengio, Yoshua, Lamblin, Pascal, Popovici, Dan, Larochelle, Hugo

Dec-31-2007–Neural Information Processing Systems

Complexity theory of circuits strongly suggests that deep architectures can be much more efficient (sometimes exponentially) than shallow architectures, in terms of computational elements required to represent some functions. Deep multi-layer neural networks have many levels of non-linearities allowing them to compactly represent highly nonlinear and highly-varying functions. However, until recently it was not clear how to train such deep networks, since gradient-based optimization starting from random initialization appears to often get stuck in poor solutions.

dbn, deep learning, neural network, (19 more...)

Neural Information Processing Systems

Dec-31-2007

Conferences PDF

Add feedback

Country:
- North America
  - Canada > Quebec (0.14)
  - United States (0.14)

Industry:
- Education (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Neural Networks > Deep Learning (1.00)
    - Statistical Learning (0.94)
  - Representation & Reasoning (1.00)

Duplicate Docs Excel Report

Title
Greedy Layer-Wise Training of Deep Networks
Greedy Layer-Wise Training of Deep Networks

Similar Docs Excel Report more

Title	Similarity	Source
None found