Why Does Deep Learning Not Have a Local Minimum?

Jun-2-2017, 18:50:08 GMT–@machinelearnbot

Editor's note: This post originally appeared as an answer to a Quora question, which also included the following: "As I understand, the chance of having a derivative zero in each of the thousands of direction is low. Is there some other reason besides this?" Yes, there is a'theoretical justification', and has taken a couple decades to flush it out. I will first point out, however, it has been observed in practice. This was pointed out by LeCun in his early work on LeNet, and is actually discussed in the'orange book', "Pattern Classification" by David G. Stork, Peter E. Hart, and Richard O. Duda. The problem has been addressed in condensed matter physics 20 years ago in the study of spin glasses.

artificial intelligence, machine learning, spin glass, (13 more...)

@machinelearnbot

Jun-2-2017, 18:50:08 GMT

News Web Page

Add feedback

Country:
- Pacific Ocean > North Pacific Ocean
  - San Francisco Bay (0.06)
- North America > United States
  - California > San Francisco County > San Francisco (0.06)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)