AITopics | Lindsten, Fredrik

Elements of Sequential Monte Carlo

Naesseth, Christian A., Lindsten, Fredrik, Schön, Thomas B.

arXiv.org Machine LearningMar-12-2019

A core problem in statistics and probabilistic machine learning is to compute probability distributions and expectations. This is the fundamental problem of Bayesian statistics and machine learning, which frames all inference as expectations with respect to the posterior distribution. The key challenge is to approximate these intractable expectations. In this tutorial, we review sequential Monte Carlo (SMC), a random-sampling-based class of methods for approximate inference. First, we explain the basics of SMC, discuss practical issues, and review theoretical results. We then examine two of the main user design choices: the proposal distributions and the so called intermediate target distributions. We review recent results on how variational inference and amortization can be used to learn efficient proposals and target distributions. Next, we discuss the SMC estimate of the normalizing constant, how this can be used for pseudo-marginal inference and inference evaluation. Throughout the tutorial we illustrate the use of SMC on various models commonly used in machine learning, such as stochastic recurrent neural networks, probabilistic graphical models, and probabilistic programs.

approximation, deep learning, neural network, (21 more...)

arXiv.org Machine Learning

1903.04797

Country:

Europe (0.67)
North America > United States (0.45)

Genre:

Research Report (0.81)
Overview (0.67)
Instructional Material > Course Syllabus & Notes (0.45)

Add feedback

Evaluating model calibration in classification

Vaicenavicius, Juozas, Widmann, David, Andersson, Carl, Lindsten, Fredrik, Roll, Jacob, Schön, Thomas B.

arXiv.org Machine LearningFeb-19-2019

Probabilistic classifiers output a probability distribution on target classes rather than just a class prediction. Besides providing a clear separation of prediction and decision making, the main advantage of probabilistic models is their ability to represent uncertainty about predictions. In safety-critical applications, it is pivotal for a model to possess an adequate sense of uncertainty, which for probabilistic classifiers translates into outputting probability distributions that are consistent with the empirical frequencies observed from realized outcomes. A classifier with such a property is called calibrated. In this work, we develop a general theoretical calibration evaluation framework grounded in probability theory, and point out subtleties present in model calibration evaluation that lead to refined interpretations of existing evaluation techniques. Lastly, we propose new ways to quantify and visualize miscalibration in probabilistic classification, including novel multidimensional reliability diagrams.

artificial intelligence, machine learning, prediction, (15 more...)

arXiv.org Machine Learning

1902.06977

Country:

Europe > Sweden (0.14)
Asia > Japan (0.14)

Genre: Research Report > New Finding (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.34)

Add feedback

Constructing the Matrix Multilayer Perceptron and its Application to the VAE

Taghia, Jalil, Bånkestad, Maria, Lindsten, Fredrik, Schön, Thomas B.

arXiv.org Machine LearningFeb-4-2019

Like most learning algorithms, the multilayer perceptrons (MLP) is designed to learn a vector of parameters from data. However, in certain scenarios we are interested in learning structured parameters (predictions) in the form of symmetric positive definite matrices. Here, we introduce a variant of the MLP, referred to as the matrix MLP, that is specialized at learning symmetric positive definite matrices. We also present an application of the model within the context of the variational autoencoder (VAE). Our formulation of the VAE extends the vanilla formulation to the cases where the recognition and the generative networks can be from the parametric family of distributions with dense covariance matrices. Two specific examples are discussed in more detail: the dense covariance Gaussian and its generalization, the power exponential distribution. Our new developments are illustrated using both synthetic and real data.

artificial intelligence, matrix, neural network, (17 more...)

arXiv.org Machine Learning

1902.01182

Country: Europe > Sweden (0.14)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (1.00)

Add feedback

Graphical model inference: Sequential Monte Carlo meets deterministic approximations

Lindsten, Fredrik, Helske, Jouni, Vihola, Matti

arXiv.org Machine LearningJan-8-2019

Approximate inference in probabilistic graphical models (PGMs) can be grouped into deterministic methods and Monte-Carlo-based methods. The former can often provide accurate and rapid inferences, but are typically associated with biases that are hard to quantify. The latter enjoy asymptotic consistency, but can suffer from high computational costs. In this paper we present a way of bridging the gap between deterministic and stochastic inference. Specifically, we suggest an efficient sequential Monte Carlo (SMC) algorithm for PGMs which can leverage the output from deterministic inference methods. While generally applicable, we show explicitly how this can be done with loopy belief propagation, expectation propagation, and Laplace approximations. The resulting algorithm can be viewed as a post-correction of the biases associated with these methods and, indeed, numerical results show clear improvements over the baseline deterministic methods as well as over "plain" SMC.

algorithm, approximation, artificial intelligence, (16 more...)

arXiv.org Machine Learning

1901.02374

Country:

Europe > Sweden (0.28)
North America > United States > California > San Francisco County > San Francisco (0.14)

Genre: Research Report (0.84)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.46)

Add feedback

Graphical model inference: Sequential Monte Carlo meets deterministic approximations

Lindsten, Fredrik, Helske, Jouni, Vihola, Matti

Neural Information Processing SystemsDec-31-2018

Approximate inference in probabilistic graphical models (PGMs) can be grouped into deterministic methods and Monte-Carlo-based methods. The former can often provide accurate and rapid inferences, but are typically associated with biases that are hard to quantify. The latter enjoy asymptotic consistency, but can suffer from high computational costs. In this paper we present a way of bridging the gap between deterministic and stochastic inference. Specifically, we suggest an efficient sequential Monte Carlo (SMC) algorithm for PGMs which can leverage the output from deterministic inference methods. While generally applicable, we show explicitly how this can be done with loopy belief propagation, expectation propagation, and Laplace approximations. The resulting algorithm can be viewed as a post-correction of the biases associated with these methods and, indeed, numerical results show clear improvements over the baseline deterministic methods as well as over "plain" SMC.

approximation, artificial intelligence, bayesian inference, (13 more...)

Neural Information Processing Systems

Country:

Europe > Sweden (0.28)
North America > United States > California > San Francisco County > San Francisco (0.14)

Genre: Research Report (0.48)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.46)

Add feedback

Graphical model inference: Sequential Monte Carlo meets deterministic approximations

Lindsten, Fredrik, Helske, Jouni, Vihola, Matti

Neural Information Processing SystemsDec-31-2018

Approximate inference in probabilistic graphical models (PGMs) can be grouped into deterministic methods and Monte-Carlo-based methods. The former can often provide accurate and rapid inferences, but are typically associated with biases that are hard to quantify. The latter enjoy asymptotic consistency, but can suffer from high computational costs. In this paper we present a way of bridging the gap between deterministic and stochastic inference. Specifically, we suggest an efficient sequential Monte Carlo (SMC) algorithm for PGMs which can leverage the output from deterministic inference methods. While generally applicable, we show explicitly how this can be done with loopy belief propagation, expectation propagation, and Laplace approximations. The resulting algorithm can be viewed as a post-correction of the biases associated with these methods and, indeed, numerical results show clear improvements over the baseline deterministic methods as well as over "plain" SMC.

approximation, artificial intelligence, inference, (15 more...)

Neural Information Processing Systems

Country:

Europe > Sweden (0.28)
North America > United States > California > San Francisco County > San Francisco (0.14)

Genre: Research Report (0.48)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.46)

Add feedback

Learning dynamical systems with particle stochastic approximation EM

Svensson, Andreas, Lindsten, Fredrik

arXiv.org Machine LearningJun-25-2018

We present the particle stochastic approximation EM (PSAEM) algorithm for learning of dynamical systems. The method builds on the EM algorithm, an iterative procedure for maximum likelihood inference in latent variable models. By combining stochastic approximation EM and particle Gibbs with ancestor sampling (PGAS), PSAEM obtains superior computational performance and convergence properties compared to plain particle-smoothing-based approximations of the EM algorithm. PSAEM can be used for plain maximum likelihood inference as well as for empirical Bayes learning of hyperparameters. Specifically, the latter point means that existing PGAS implementations easily can be extended with PSAEM to estimate hyperparameters at almost no extra computational cost. We discuss the convergence properties of the algorithm, and demonstrate it on several machine learning applications.

bayesian inference, state-space model, survey article, (16 more...)

arXiv.org Machine Learning

1806.09548

Country:

North America > United States > New York (0.14)
North America > Canada > Ontario > Toronto (0.14)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

Learning of state-space models with highly informative observations: a tempered Sequential Monte Carlo solution

Svensson, Andreas, Schön, Thomas B., Lindsten, Fredrik

arXiv.org Machine LearningDec-13-2017

Probabilistic (or Bayesian) modeling and learning offers interesting possibilities for systematic representation of uncertainty using probability theory. However, probabilistic learning often leads to computationally challenging problems. Some problems of this type that were previously intractable can now be solved on standard personal computers thanks to recent advances in Monte Carlo methods. In particular, for learning of unknown parameters in nonlinear state-space models, methods based on the particle filter (a Monte Carlo method) have proven very useful. A notoriously challenging problem, however, still occurs when the observations in the state-space model are highly informative, i.e. when there is very little or no measurement noise present, relative to the amount of process noise. The particle filter will then struggle in estimating one of the basic components for probabilistic learning, namely the likelihood $p($data$|$parameters$)$. To this end we suggest an algorithm which initially assumes that there is substantial amount of artificial measurement noise present. The variance of this noise is sequentially decreased in an adaptive fashion such that we, in the end, recover the original problem or possibly a very close approximation of it. The main component in our algorithm is a sequential Monte Carlo (SMC) sampler, which gives our proposed method a clear resemblance to the SMC^2 method. Another natural link is also made to the ideas underlying the approximate Bayesian computation (ABC). We illustrate it with numerical examples, and in particular show promising results for a challenging Wiener-Hammerstein benchmark problem.

bayesian inference, particle filter, survey article, (18 more...)

arXiv.org Machine Learning

doi: 10.1016/j.ymssp.2017.09.016

1702.01618

Country:

North America > United States (0.14)
Europe > Belgium (0.14)
Europe > Spain (0.14)
Asia > China (0.14)

Genre: Research Report (0.82)

Add feedback

Pseudo-extended Markov chain Monte Carlo

Nemeth, Christopher, Lindsten, Fredrik, Filippone, Maurizio, Hensman, James

arXiv.org Machine LearningAug-17-2017

Sampling from the posterior distribution using Markov chain Monte Carlo (MCMC) methods can require an exhaustive number of iterations to fully explore the correct posterior. This is often the case when the posterior of interest is multi-modal, as the MCMC sampler can become trapped in a local mode for a large number of iterations. In this paper, we introduce the pseudo-extended MCMC method as an approach for improving the mixing of the MCMC sampler in complex posterior distributions. The pseudo-extended method augments the state-space of the posterior using pseudo-samples as auxiliary variables, where on the extended space, the MCMC sampler is able to easily move between the well-separated modes of the posterior. We apply the pseudo-extended method within an Hamiltonian Monte Carlo sampler and show that by using the No U-turn algorithm (Hoffman and Gelman, 2014), our proposed sampler is completely tuning free. We compare the pseudo-extended method against well-known tempered MCMC algorithms and show the advantages of the new sampler on a number of challenging examples from the statistics literature.

algorithm, artificial intelligence, bayesian inference, (19 more...)

arXiv.org Machine Learning

1708.05239

Country:

North America > United States (0.46)
Europe (0.28)

Genre: Research Report (0.50)

Add feedback

Interacting Particle Markov Chain Monte Carlo

Rainforth, Tom, Naesseth, Christian A., Lindsten, Fredrik, Paige, Brooks, van de Meent, Jan-Willem, Doucet, Arnaud, Wood, Frank

arXiv.org Machine LearningApr-12-2017

We introduce interacting particle Markov chain Monte Carlo (iPMCMC), a PMCMC method based on an interacting pool of standard and conditional sequential Monte Carlo samplers. Like related methods, iPMCMC is a Markov chain Monte Carlo sampler on an extended space. We present empirical results that show significant improvements in mixing rates relative to both non-interacting PMCMC samplers, and a single PMCMC sampler with an equivalent memory and computational budget. An additional advantage of the iPMCMC method is that it is suitable for distributed and multi-core architectures.

artificial intelligence, ipmcmc, machine learning, (15 more...)

arXiv.org Machine Learning

1602.05128

Country:

Europe > Sweden (0.14)
Europe > France (0.14)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Add feedback

Filters

Collaborating Authors

Lindsten, Fredrik

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Elements of Sequential Monte Carlo

Evaluating model calibration in classification

Constructing the Matrix Multilayer Perceptron and its Application to the VAE

Graphical model inference: Sequential Monte Carlo meets deterministic approximations

Graphical model inference: Sequential Monte Carlo meets deterministic approximations

Graphical model inference: Sequential Monte Carlo meets deterministic approximations

Learning dynamical systems with particle stochastic approximation EM

Learning of state-space models with highly informative observations: a tempered Sequential Monte Carlo solution

Pseudo-extended Markov chain Monte Carlo

Interacting Particle Markov Chain Monte Carlo