Tuning Stochastic Gradient Algorithms for Statistical Inference via Large-Sample Asymptotics

Negrea, Jeffrey, Yang, Jun, Feng, Haoyue, Roy, Daniel M., Huggins, Jonathan H.

Jul-20-2023–arXiv.org Artificial Intelligence

The tuning of stochastic gradient algorithms (SGAs) for optimization and sampling is often based on heuristics and trial-and-error rather than generalizable theory. We address this theory--practice gap by characterizing the large-sample statistical asymptotics of SGAs via a joint step-size--sample-size scaling limit. We show that iterate averaging with a large fixed step size is robust to the choice of tuning parameters and asymptotically has covariance proportional to that of the MLE sampling distribution. We also prove a Bernstein--von Mises-like theorem to guide tuning, including for generalized posteriors that are robust to model misspecification. Numerical experiments validate our results and recommendations in realistic finite-sample regimes. Our work lays the foundation for a systematic analysis of other stochastic gradient Markov chain Monte Carlo algorithms for a wide range of models.

algorithm, artificial intelligence, machine learning, (15 more...)

arXiv.org Artificial Intelligence

Jul-20-2023

arXiv.org PDF

Add feedback

Country:
- North America
  - United States > California (0.04)
  - Canada > Ontario
    - Toronto (0.14)
    - Waterloo Region > Waterloo (0.04)
- Europe
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
  - Denmark > Capital Region
    - Copenhagen (0.04)

Genre:
- Research Report > New Finding (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Machine Learning
    - Statistical Learning > Gradient Descent (0.93)
    - Learning Graphical Models > Undirected Networks
      - Markov Models (0.34)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found