Rao-Blackwellized Stochastic Gradients for Discrete Distributions

Liu, Runjing, Regier, Jeffrey, Tripuraneni, Nilesh, Jordan, Michael I., McAuliffe, Jon

Oct-10-2018–arXiv.org Machine Learning

We wish to compute the gradient of an expectation over a finite or countably infinite sample space having $K \leq \infty$ categories. When $K$ is indeed infinite, or finite but very large, the relevant summation is intractable. Accordingly, various stochastic gradient estimators have been proposed. In this paper, we describe a technique that can be applied to reduce the variance of any such estimator, without changing its bias---in particular, unbiasedness is retained. We show that our technique is an instance of Rao-Blackwellization, and we demonstrate the improvement it yields in empirical studies on both synthetic and real-world data.

artificial intelligence, estimator, machine learning, (16 more...)

arXiv.org Machine Learning

Oct-10-2018

arXiv.org PDF

Add feedback

Country:
- North America > United States > California (0.14)

Genre:
- Research Report (0.50)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Neural Networks (0.69)
    - Statistical Learning > Gradient Descent (0.63)
  - Representation & Reasoning > Mathematical & Statistical Methods (0.63)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found