AITopics | bayes predictor

Collaborating Authors

bayes predictor

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

dc96134e169de5aea1ba1fc34dfb8419-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-12-2026, 07:56:38 GMT

bayes, group sufficiency gap, subgroup, (11 more...)

Neural Information Processing Systems

Country:

North America > Canada > Quebec > Montreal (0.14)
North America > United States > Virginia (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(3 more...)

Genre: Research Report (0.67)

Industry:

Health & Medicine (0.46)
Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)
Information Technology > Artificial Intelligence > Vision (0.68)

Add feedback

42ae1544956fbe6e09242e6cd752444c-Supplemental.pdf

Neural Information Processing SystemsFeb-8-2026, 05:26:27 GMT

activation, mis, mis 0, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.40)

Add feedback

Conditional independence testing under misspecified inductive biases

Neural Information Processing SystemsDec-26-2025, 14:52:22 GMT

Conditional independence (CI) testing is a fundamental and challenging task in modern statistics and machine learning. Many modern methods for CI testing rely on powerful supervised learning methods to learn regression functions or Bayes predictors as an intermediate step; we refer to this class of tests as regression-based tests. Although these methods are guaranteed to control Type-I error when the supervised learning methods accurately estimate the regression functions or Bayes predictors of interest, their behavior is less understood when they fail due to misspecified inductive biases; in other words, when the employed models are not flexible enough or when the training algorithm does not induce the desired predictors. Then, we study the performance of regression-based CI tests under misspecified inductive biases. Namely, we propose new approximations or upper bounds for the testing errors of three regression-based tests that depend on misspecification errors. Moreover, we introduce the Rao-Blackwellized Predictor Test (RBPT), a regression-based CI test robust against misspecified inductive biases. Finally, we conduct experiments with artificial and real data, showcasing the usefulness of our theory and methods.

conditional independence testing, misspecified inductive bias, name change, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)

Add feedback

In-Context Learning Is Provably Bayesian Inference: A Generalization Theory for Meta-Learning

Wakayama, Tomoya, Suzuki, Taiji

arXiv.org Machine LearningOct-14-2025

This paper develops a finite-sample statistical theory for in-context learning (ICL), analyzed within a meta-learning framework that accommodates mixtures of diverse task types. We introduce a principled risk decomposition that separates the total ICL risk into two orthogonal components: Bayes Gap and Posterior Variance. The Bayes Gap quantifies how well the trained model approximates the Bayes-optimal in-context predictor. For a uniform-attention Transformer, we derive a non-asymptotic upper bound on this gap, which explicitly clarifies the dependence on the number of pretraining prompts and their context length. The Posterior Variance is a model-independent risk representing the intrinsic task uncertainty. Our key finding is that this term is determined solely by the difficulty of the true underlying task, while the uncertainty arising from the task mixture vanishes exponentially fast with only a few in-context examples. Together, these results provide a unified view of ICL: the Transformer selects the optimal meta-algorithm during pretraining and rapidly converges to the optimal algorithm for the true task at test time.

artificial intelligence, bayesian inference, machine learning, (17 more...)

arXiv.org Machine Learning

2510.10981

Country:

North America > United States (0.45)
Asia > Middle East (0.27)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.67)

Add feedback

Supplementary materials - NeuMiss networks: differentiable programming for supervised learning with missing values A Proofs

Neural Information Processing SystemsOct-2-2025, 18:42:38 GMT

Proof of Lemma 2. Identifying the second and first order terms in X we get: The last equality allows to conclude the proof. Additionally, assume that either Assumption 2 or Assumption 3 holds. This concludes the proof according to Lemma 1. Here we establish an auxiliary result, controlling the convergence of Neumann iterates to the matrix inverse. Note that Proposition A.1 can easily be extended to the general case by working with M (61) i.e., a M nonlinearity is applied to the activations.

artificial intelligence, machine learning, mis, (19 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.40)

Add feedback

On Learning Fairness and Accuracy on Multiple Subgroups

Neural Information Processing SystemsAug-19-2025, 10:37:51 GMT

In the upper-level, the fair predictor is updated to be close to all subgroup specific predictors. We further prove that such a bilevel objective can effectively control the group sufficiency and generalization error. We evaluate the proposed framework on real-world datasets.

artificial intelligence, machine learning, natural language, (13 more...)

Neural Information Processing Systems

Country:

North America > Canada > Quebec > Montreal (0.14)
North America > United States > Virginia (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(3 more...)

Genre: Research Report (0.67)

Industry:

Health & Medicine (0.46)
Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)
Information Technology > Artificial Intelligence > Vision (0.68)

Add feedback

5fe8fdc79ce292c39c5f209d734b7206-Supplemental.pdf

Neural Information Processing SystemsAug-14-2025, 19:01:45 GMT

bayes predictor, imputation function, mis, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.47)

Add feedback

5fe8fdc79ce292c39c5f209d734b7206-Paper.pdf

Neural Information Processing SystemsAug-14-2025, 19:01:43 GMT

bayes predictor, imputation, imputation function, (14 more...)

Neural Information Processing Systems

Country: Europe > France > Occitanie > Hérault > Montpellier (0.04)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Modeling & Simulation (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.31)

Add feedback

Review for NeurIPS paper: NeuMiss networks: differentiable programming for supervised learning with missing values.

Neural Information Processing SystemsJan-23-2025, 19:30:59 GMT

The paper attacks the classical problem of linear regression with missing values. It computes the Bayes predictor in several cases with missing values and then uses Neumann series to approximate the Bayes predictor. This approximation is then used to design Neural Networks with RelU functions. The propositions describing self-masking missingness, appears to be a novel concept, are interesting but can be considered slightly restrictive because of Linear Gaussian assumptions. However, both the results and the methods should be of interest to NeuriPS 2020 community.

differentiable programming, neumiss network, neurips paper, (2 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.36)

Add feedback

Filters

Collaborating Authors

bayes predictor

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

dc96134e169de5aea1ba1fc34dfb8419-Supplemental-Conference.pdf

42ae1544956fbe6e09242e6cd752444c-Supplemental.pdf

Conditional independence testing under misspecified inductive biases

In-Context Learning Is Provably Bayesian Inference: A Generalization Theory for Meta-Learning

Supplementary materials - NeuMiss networks: differentiable programming for supervised learning with missing values A Proofs

42ae1544956fbe6e09242e6cd752444c-Paper.pdf

On Learning Fairness and Accuracy on Multiple Subgroups

5fe8fdc79ce292c39c5f209d734b7206-Supplemental.pdf

5fe8fdc79ce292c39c5f209d734b7206-Paper.pdf

Review for NeurIPS paper: NeuMiss networks: differentiable programming for supervised learning with missing values.