A Variant of Gradient Descent Algorithm Based on Gradient Averaging

Purkayastha, Saugata, Purkayastha, Sukannya

Dec-10-2020–arXiv.org Machine Learning

In this work, we study an optimizer, Grad-Avg to optimize error functions. We establish the convergence of the sequence of iterates of Grad-Avg mathematically to a minimizer (under boundedness assumption). We apply Grad-Avg along with some of the popular optimizers on regression as well as classification tasks. In regression tasks, it is observed that the behaviour of Grad-Avg is almost identical with Stochastic Gradient Descent (SGD). We present a mathematical justification of this fact. In case of classification tasks, it is observed that the performance of Grad-Avg can be enhanced by suitably scaling the parameters. Experimental results demonstrate that Grad-Avg converges faster than the other state-of-the-art optimizers for the classification task on two benchmark datasets.

classification task, dataset, optimizer, (10 more...)

arXiv.org Machine Learning

Dec-10-2020

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New York (0.04)
- Asia
  - Middle East > Jordan (0.04)
  - India > West Bengal
    - Kharagpur (0.04)

Genre:
- Research Report (0.70)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Neural Networks (1.00)
  - Statistical Learning > Gradient Descent (0.93)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found