AITopics | mean-field approximation

Variational Bayesian posterior inference often requires simplifying approximations such as mean-field parametrisation to ensure tractability. However, prior work has associated the variational mean-field approximation for Bayesian neural networks with underfitting in the case of small datasets or large model sizes. In this work, we show that invariances in the likelihood function of over-parametrised models contribute to this phenomenon because these invariances complicate the structure of the posterior by introducing discrete and/or continuous modes which cannot be well approximated by Gaussian mean-field distributions. In particular, we show that the mean-field approximation has an additional gap in the evidence lower bound compared to a purpose-built posterior that takes into account the known invariances. Importantly, this invariance gap is not constant; it vanishes as the approximation reverts to the prior. We proceed by first considering translation invariances in a linear model with a single data point in detail. We show that, while the true posterior can be constructed from a mean-field parametrisation, this is achieved only if the objective function takes into account the invariance gap. Then, we transfer our analysis of the linear model to neural networks. Our analysis provides a framework for future work to explore solutions to the invariance problem.

detrimental effect, invariance, name change, (11 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.50)

Add feedback

Cluster Variational Approximations for Structure Learning of Continuous-Time Bayesian Networks from Incomplete Data

Dominik Linzner, Heinz Koeppl

Neural Information Processing SystemsNov-20-2025, 19:31:42 GMT

Neural Information Processing Systems http://nips.cc/

approximation, artificial intelligence, machine learning, (15 more...)

Neural Information Processing Systems

Country:

Europe > Germany > Hesse > Darmstadt Region > Darmstadt (0.04)
North America > Canada > Quebec > Montreal (0.04)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.84)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.52)

Add feedback

aa1f5f73327ba40d47ebce155e785aaf-AuthorFeedback.pdf

Neural Information Processing SystemsNov-15-2025, 00:16:25 GMT

We would like to thank all the reviewers for their thoughtful comments and their enthusiasm for our work. These results are consistent with those of Zoltowski et al. [2020], where they found Laplace EM compared Section 3. Segmenting the continuous latent states for each population (which is equivalent to imposing hard constraints On top of that, the "sticky" parameterization of discrete state transitions reveals which neural populations C. elegans offers an illustrative demonstration of the mp-srSLDS For example, we explore interactions between ganglia in Appendix C. Thanks again for spending the time to provide valuable feedback on our work.

approximation, interaction, mean-field approximation, (15 more...)

Neural Information Processing Systems

Country: Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.05)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.52)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.37)

Add feedback

Revisiting Logistic-softmax Likelihood in Bayesian Meta-learning for Few-shot Classification

Neural Information Processing SystemsOct-8-2025, 20:58:13 GMT

Furthermore, we theoretically and empirically show that softmax can be viewed as a special case of logistic-softmax and logistic-softmax induces a larger family of data distribution than softmax.

artificial intelligence, likelihood, machine learning, (16 more...)

Neural Information Processing Systems

Country: Asia > China (0.04)

Genre: Research Report > New Finding (0.93)

Add feedback

On the detrimental effect of invariances in the likelihood for variational inference Richard Kurle A WS AI Labs

Neural Information Processing SystemsOct-2-2025, 19:31:22 GMT

We proceed by first considering translation invariances in a linear model with a single data point in detail. We show that, while the true posterior can be constructed from a mean-field parametrisation, this is achieved only if the objective function takes into account the invariance gap.

artificial intelligence, invariance, machine learning, (18 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.04)
Africa > Ethiopia > Addis Ababa > Addis Ababa (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.68)

Add feedback

Copula variational inference

Dustin Tran, David Blei, Edo M. Airoldi

Neural Information Processing SystemsOct-2-2025, 15:22:08 GMT

We develop a general variational inference method that preserves dependency among the latent variables. Our method uses copulas to augment the families of distributions used in mean-field and structured approximations. Copulas model the dependency that is not captured by the original variational distribution, and thus the augmented variational family guarantees better approximations to the posterior. With stochastic optimization, inference on the augmented distribution is scalable. Furthermore, our strategy is generic: it can be applied to any inference procedure that currently uses the mean-field or structured approach. Copula variational inference has many advantages: it reduces bias; it is less sensitive to local optima; it is less sensitive to hyperparameters; and it helps characterize and interpret the dependency among the latent variables.

artificial intelligence, copula, machine learning, (18 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.05)
North America > United States > New York (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.91)

Add feedback

Collaborating Authors

mean-field approximation

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Cluster Variational Approximations for Structure Learning of Continuous-Time Bayesian Networks from Incomplete Data

6cdb2cbb2083477cca5243843d6dad06-Paper-Conference.pdf

2dfe1946b3003933b7f8ddd71f24dbb1-Paper.pdf

040ca38cefb1d9226d79c05dd25469cb-Paper.pdf

On the detrimental effect of invariances in the likelihood for variational inference

Cluster Variational Approximations for Structure Learning of Continuous-Time Bayesian Networks from Incomplete Data

aa1f5f73327ba40d47ebce155e785aaf-AuthorFeedback.pdf

Revisiting Logistic-softmax Likelihood in Bayesian Meta-learning for Few-shot Classification

On the detrimental effect of invariances in the likelihood for variational inference Richard Kurle A WS AI Labs

Copula variational inference