A 2021 Guide to improving CNNs-Optimizers: Adam vs SGD

Jun-21-2021, 07:05:20 GMT–#artificialintelligence

This will be my third post on my series A 2021 Guide to improving CNNs. Optimizers can be explained as a mathematical function to modify the weights of the network given the gradients and additional information, depending on the formulation of the optimizer. Optimizers are built upon the idea of gradient descent, the greedy approach of iteratively decreasing the loss function by following the gradient. Such functions can be as simple as subtracting the gradients from the weights, or can also be very complex. Better optimizers are mainly focused on being faster and efficient but are also often known to generalize well(less overfitting) compared to others.

arxiv preprint arxiv, optimizer, sgd, (13 more...)

#artificialintelligence

Jun-21-2021, 07:05:20 GMT

News Web Page

Add feedback

Genre:
- Research Report (0.31)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.39)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found