Attention for Inference Compilation

Harvey, William, Munk, Andreas, Baydin, Atılım Güneş, Bergholm, Alexander, Wood, Frank

Oct-25-2019–arXiv.org Machine Learning

We present a new approach to automatic amortized inference in universal probabilistic programs which improves performance compared to current methods. Our approach is a variation of inference compilation (IC) which leverages deep neural networks to approximate a posterior distribution over latent variables in a probabilistic program. A challenge with existing IC network architectures is that they can fail to model long-range dependencies between latent variables. To address this, we introduce an attention mechanism that attends to the most salient variables previously sampled in the execution of a probabilistic program. We demonstrate that the addition of attention allows the proposal distributions to better match the true posterior, enhancing inference about latent variables in simulators.

architecture, deep learning, neural network, (19 more...)

arXiv.org Machine Learning

Oct-25-2019

arXiv.org PDF

Add feedback

Country:
- Europe > United Kingdom (0.28)
- North America
  - Canada (0.30)
  - United States (0.46)

Genre:
- Research Report (0.40)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (0.96)
  - Representation & Reasoning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found