Adam - momentum y (aka. cost) terms. • r/MachineLearning

Dec-27-2017, 09:36:29 GMT–#artificialintelligence

It was the Newton-Raphson method for finding roots of an equation. I thought this method mostly applies for minimization in machine learning as cost is always defined as a positive real valued function. But it was pointed out to me that the update equation of newton-raphson method, which is x x - y / dy_dx, is unstable at local minimas (where dy_dx 0) since it makes the update burst to infinity. Eventually, I landed on this update equation, x x - ((y * dy_dx) / (y dy_dx2)); dy_dx derivative of y wrt. To relate this update equation with the title: if we consider the update portion of the equation - g(x, y) (y * x) / (y x2); y 0 It is quite similar to adam since there is a square gradient term in the denominator and the gradient term in the numerator.

artificial intelligence, equation, social media, (8 more...)

#artificialintelligence

Dec-27-2017, 09:36:29 GMT

News Web Page

Add feedback

Industry:
- Media > News (0.42)
- Information Technology > Smart Houses & Appliances (0.40)

Technology:
- Information Technology
  - Communications > Social Media (0.81)
  - Artificial Intelligence (0.60)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found