On Markov Chain Gradient Descent

Feb-14-2020, 20:56:42 GMT–Neural Information Processing Systems

Stochastic gradient methods are the workhorse (algorithms) of large-scale optimization problems in machine learning, signal processing, and other computational sciences and engineering. This paper studies Markov chain gradient descent, a variant of stochastic gradient descent where the random samples are taken on the trajectory of a Markov chain. Existing results of this method assume convex objectives and a reversible Markov chain and thus have their limitations. We establish new non-ergodic convergence under wider step sizes, for nonconvex problems, and for non-reversible finite-state Markov chains. Nonconvexity makes our method applicable to broader problem classes.

finite-state markov chain, markov chain, markov chain gradient descent

Neural Information Processing Systems

Feb-14-2020, 20:56:42 GMT

Conferences Web Page

Add feedback

Genre:
- Research Report (0.46)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Statistical Learning > Gradient Descent (1.00)
  - Learning Graphical Models > Undirected Networks
    - Markov Models (1.00)