An overview of gradient descent optimization algorithms

Nov-23-2017, 22:30:19 GMT–@machinelearnbot

Note: If you are looking for a review paper, this blog post is also available as an article on arXiv. Added derivations of AdaMax and Nadam. Gradient descent is one of the most popular algorithms to perform optimization and by far the most common way to optimize neural networks. At the same time, every state-of-the-art Deep Learning library contains implementations of various algorithms to optimize gradient descent (e.g. These algorithms, however, are often used as black-box optimizers, as practical explanations of their strengths and weaknesses are hard to come by.

gradient, neural network, survey article, (17 more...)

@machinelearnbot

Nov-23-2017, 22:30:19 GMT

News Web Page

Add feedback

Genre:
- Overview (0.84)

Industry:
- Education (0.95)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)