Differentiable Annealed Importance Sampling and the Perils of Gradient Noise

Neural Information Processing Systems 

As a further advantage, DAIS allows for mini-batch gradients.