Amortized variance reduction for doubly stochastic objectives

Boustati, Ayman, Vakili, Sattar, Hensman, James, John, ST

Mar-9-2020–arXiv.org Machine Learning

Approximate inference in complex probabilistic models such as deep Gaussian processes requires the optimisation of doubly stochastic objective functions. These objectives incorporate randomness both from mini-batch subsampling of the data and from Monte Carlo estimation of expectations. If the gradient variance is high, the stochastic optimisation problem becomes difficult with a slow rate of convergence. Control variates can be used to reduce the variance, but past approaches do not take into account how mini-batch stochasticity affects sampling stochasticity, resulting in sub-optimal variance reduction. We propose a new approach in which we use a recognition network to cheaply approximate the optimal control variate for each mini-batch, with no additional model gradient computations. We illustrate the properties of this proposal and test its performance on logistic regression and deep Gaussian processes.

control variate, objective, variate, (11 more...)

arXiv.org Machine Learning

Mar-9-2020

arXiv.org PDF

Add feedback

Country:
- North America
  - United States > New York
    - New York County > New York City (0.04)
  - Canada > Alberta
    - Census Division No. 15 > Improvement District No. 9 > Banff (0.04)
- Europe
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
    - West Midlands > Coventry (0.04)
  - Iceland > Capital Region
    - Reykjavik (0.04)
- Asia > Middle East
  - Jordan (0.04)

Genre:
- Research Report > New Finding (0.37)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Uncertainty (0.86)
  - Machine Learning
    - Neural Networks (1.00)
    - Statistical Learning > Regression (0.37)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found