AITopics | Bayesian Inference

Collaborating Authors

Bayesian Inference

Bayes' Theorem allows a program to infer the probabilities of likely causes from the probabilities of their effects, when what it is given are the probabilities of effects, given the causes.

News Overviews Instructional Materials AI-Alerts Classics

A flexible Bayesian non-parametric mixture model reveals multiple dependencies of swap errors in visual working memory

Radmard, Puria, Bays, Paul M., Lengyel, Máté

arXiv.org Artificial IntelligenceMay-5-2025

Human behavioural data in psychophysics has been used to elucidate the underlying mechanisms of many cognitive processes, such as attention, sensorimotor integration, and perceptual decision making. Visual working memory has particularly benefited from this approach: analyses of VWM errors have proven crucial for understanding VWM capacity and coding schemes, in turn constraining neural models of both. One poorly understood class of VWM errors are swap errors, whereby participants recall an uncued item from memory. Swap errors could arise from erroneous memory encoding, noisy storage, or errors at retrieval time - previous research has mostly implicated the latter two. However, these studies made strong a priori assumptions on the detailed mechanisms and/or parametric form of errors contributed by these sources. Here, we pursue a data-driven approach instead, introducing a Bayesian non-parametric mixture model of swap errors (BNS) which provides a flexible descriptive model of swapping behaviour, such that swaps are allowed to depend on both the probed and reported features of every stimulus item. We fit BNS to the trial-by-trial behaviour of human participants and show that it recapitulates the strong dependence of swaps on cue similarity in multiple datasets. Critically, BNS reveals that this dependence coexists with a non-monotonic modulation in the report feature dimension for a random dot motion direction-cued, location-reported dataset. The form of the modulation inferred by BNS opens new questions about the importance of memory encoding in causing swap errors in VWM, a distinct source to the previously suggested binding and cueing errors. Our analyses, combining qualitative comparisons of the highly interpretable BNS parameter structure with rigorous quantitative model comparison and recovery methods, show that previous interpretations of swap errors may have been incomplete.

artificial intelligence, machine learning, swap error, (19 more...)

arXiv.org Artificial Intelligence

2505.01178

Genre: Research Report (1.00)

Industry: Health & Medicine > Therapeutic Area (0.48)

Technology:

Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.71)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.68)

Add feedback

Inference for max-linear Bayesian networks with noise

Adams, Mark, Ferry, Kamillo, Yoshida, Ruriko

arXiv.org Machine LearningMay-2-2025

Max-Linear Bayesian Networks (MLBNs) provide a powerful framework for causal inference in extreme-value settings; we consider MLBNs with noise parameters with a given topology in terms of the max-plus algebra by taking its logarithm. Then, we show that an estimator of a parameter for each edge in a directed acyclic graph (DAG) is distributed normally. We end this paper with computational experiments with the expectation and maximization (EM) algorithm and quadratic optimization.

artificial intelligence, bayesian network, machine learning, (18 more...)

arXiv.org Machine Learning

2505.00229

Country:

Europe > Austria > Vienna (0.14)
North America > United States > Rhode Island > Providence County > Providence (0.04)
Europe > Germany > Berlin (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.87)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.87)

Add feedback

A Study on Group Decision Making Problem Based on Fuzzy Reasoning and Bayesian Networks

Rong, Shui-jin, Guo, Wei, Zhang, Da-qing

arXiv.org Artificial IntelligenceMay-1-2025

Aiming at the group decision - making problem with multi - objective attributes, this study proposes a group decision - making system that integrates fuzzy inference and Bayesian network. A fuzzy rule base is constructed by combining threshold values, membership functions, expert experience, and domain knowledge to address quantitative challenges such as scale differences and expert linguistic variables. A hierarchical Bayesian network is designed, featuring a directed acyclic graph with nodes selected by experts, and maximum likelihood estimation is used to dynamically optimize the conditional probability table, modeling the nonlinear correlations among multidimensional indices for posterior probability aggregation. In a comprehensive student evaluation case, this method is compared with the traditional weighted scoring approach. The results indicate that the proposed method demonstrates effectiveness in both rule criterion construction and ranking consistency, with a classification accuracy of 86.0% and an F1 value improvement of 53.4% over the traditional method. Additionally, computational experiments on real - world datasets across various group decision scenarios assess the method's performance and robustness, providing evidence of its reliability in diverse contexts.

artificial intelligence, bayesian network, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2504.21568

Genre: Research Report > New Finding (0.46)

Industry:

Health & Medicine > Diagnostic Medicine (0.94)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.70)
Health & Medicine > Therapeutic Area > Oncology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

Confidence in Large Language Model Evaluation: A Bayesian Approach to Limited-Sample Challenges

Xiao, Xiao, Su, Yu, Zhang, Sijing, Chen, Zhang, Chen, Yadong, Liu, Tian

arXiv.org Artificial IntelligenceMay-1-2025

Large language models (LLMs) exhibit probabilistic output characteristics, yet conventional evaluation frameworks rely on deterministic scalar metrics. This study introduces a Bayesian approach for LLM capability assessment that integrates prior knowledge through probabilistic inference, addressing limitations under limited-sample regimes. By treating model capabilities as latent variables and leveraging a curated query set to induce discriminative responses, we formalize model ranking as a Bayesian hypothesis testing problem over mutually exclusive capability intervals. Experimental evaluations with GPT-series models demonstrate that the proposed method achieves superior discrimination compared to conventional evaluation methods. Results indicate that even with reduced sample sizes, the approach maintains statistical robustness while providing actionable insights, such as probabilistic statements about a model's likelihood of surpassing specific baselines. This work advances LLM evaluation methodologies by bridging Bayesian inference with practical constraints in real-world deployment scenarios.

large language model, machine learning, natural language, (14 more...)

arXiv.org Artificial Intelligence

2504.21303

Country:

Asia > India (1.00)
North America > United States (0.67)

Genre: Research Report (1.00)

Industry:

Transportation > Passenger (1.00)
Transportation > Ground > Road (1.00)
Automobiles & Trucks > Manufacturer (1.00)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

Power Flow Approximations for Multiphase Distribution Networks using Gaussian Processes

Glover, Daniel, Pareek, Parikshit, Deka, Deepjyoti, Dubey, Anamika

arXiv.org Artificial IntelligenceMay-1-2025

Learning-based approaches are increasingly leveraged to manage and coordinate the operation of grid-edge resources in active power distribution networks. Among these, model-based techniques stand out for their superior data efficiency and robustness compared to model-free methods. However, effective model learning requires a learning-based approximator for the underlying power flow model. This study extends existing work by introducing a data-driven power flow method based on Gaussian Processes (GPs) to approximate the multiphase power flow model, by mapping net load injections to nodal voltages. Simulation results using the IEEE 123-bus and 8500-node distribution test feeders demonstrate that the trained GP model can reliably predict the nonlinear power flow solutions with minimal training data. We also conduct a comparative analysis of the training efficiency and testing performance of the proposed GP-based power flow approximator against a deep neural network-based approximator, highlighting the advantages of our data-efficient approach. Results over realistic operating conditions show that despite an 85% reduction in the training sample size (corresponding to a 92.8% improvement in training time), GP models produce a 99.9% relative reduction in mean absolute error compared to the baselines of deep neural networks.

artificial intelligence, machine learning, power flow, (19 more...)

arXiv.org Artificial Intelligence

2504.2126

Country: North America > United States (0.28)

Genre: Research Report (1.00)

Industry: Energy > Power Industry (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)

Add feedback

Sparse mixed linear modeling with anchor-based guidance for high-entropy alloy discovery

Murakami, Ryo, Miura, Seiji, Endo, Akihiro, Minamoto, Satoshi

arXiv.org Machine LearningApr-30-2025

REGULAR ARTICLE Sparse mixed linear modeling with anchor-based guidance for high-entropy alloy discovery Ryo Murakami a, Seiji Miura b, Akihiro Endo a and Satoshi Minamoto a a Materials Data Platform, Research Network and Facility Services Division, National Institute for Materials Science, Tsukuba 305-0044, Ibaraki, Japan b Division of Materials Science and Engineering, Faculty of Engineering, Hokkaido University, Sapporo 060-8628, Hokkaido, Japan ARTICLE HISTORY Compiled April 30, 2025 ABSTRACT High-entropy alloys have attracted attention for their exceptional mechanical properties and thermal stability. To solve this problem, machine learning techniques have been increasingly employed for property prediction and high-throughput screening. Nevertheless, highly accurate nonlinear models often suffer from a lack of interpretability, which is a major limitation. In this study, we focus on local data structures that emerge from the greedy search behavior inherent to experimental data acquisition. By introducing a linear and low-dimensional mixture regression model, we strike a balance between predictive performance and model interpretability. In addition, we develop an algorithm that simultaneously performs prediction and feature selection by considering multiple candidate descriptors. Through a case study on high-entropy alloys, this study introduces a method that combines anchor-guided clustering and sparse linear modeling to address biased data structures arising from greedy exploration in materials science. KEYWORDS Sparse modeling; Mixed linear model; Bayesian inference; Materials informatics; Data-driven science; High-entropy alloys 1. Introduction In recent years, high-entropy alloys (HEAs) have garnered attention as next-generation materials for their outstanding mechanical properties, thermal stability, and corrosion resistance [1,2]. Unlike conventional alloy designs, HEAs--also referred to as multi-principal element alloys--comprise multiple (typically five or more) principal elements, offering a high degree of chemical and structural freedom. This unique composition enables the exploration of novel properties unattainable in traditional materials systems.

alloy, artificial intelligence, machine learning, (20 more...)

arXiv.org Machine Learning

2504.20354

Country:

Asia > Japan > Honshū > Kantō > Ibaraki Prefecture > Tsukuba (0.24)
Asia > Japan > Hokkaidō > Hokkaidō Prefecture > Sapporo (0.24)

Genre: Research Report > New Finding (0.51)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

Transforming Evidence Synthesis: A Systematic Review of the Evolution of Automated Meta-Analysis in the Age of AI

Li, Lingbo, Mathrani, Anuradha, Susnjak, Teo

arXiv.org Artificial IntelligenceApr-30-2025

Exponential growth in scientific literature has heightened the demand for efficient evidence-based synthesis, driving the rise of the field of Automated Meta-analysis (AMA) powered by natural language processing and machine learning. This PRISMA systematic review introduces a structured framework for assessing the current state of AMA, based on screening 978 papers from 2006 to 2024, and analyzing 54 studies across diverse domains. Findings reveal a predominant focus on automating data processing (57%), such as extraction and statistical modeling, while only 17% address advanced synthesis stages. Just one study (2%) explored preliminary full-process automation, highlighting a critical gap that limits AMA's capacity for comprehensive synthesis. Despite recent breakthroughs in large language models (LLMs) and advanced AI, their integration into statistical modeling and higher-order synthesis, such as heterogeneity assessment and bias evaluation, remains underdeveloped. This has constrained AMA's potential for fully autonomous meta-analysis. From our dataset spanning medical (67%) and non-medical (33%) applications, we found that AMA has exhibited distinct implementation patterns and varying degrees of effectiveness in actually improving efficiency, scalability, and reproducibility. While automation has enhanced specific meta-analytic tasks, achieving seamless, end-to-end automation remains an open challenge. As AI systems advance in reasoning and contextual understanding, addressing these gaps is now imperative. Future efforts must focus on bridging automation across all meta-analysis stages, refining interpretability, and ensuring methodological robustness to fully realize AMA's potential for scalable, domain-agnostic synthesis.

large language model, machine learning, natural language, (22 more...)

arXiv.org Artificial Intelligence

2504.20113

Country: North America > United States (0.92)

Genre:

Research Report > Experimental Study (1.00)
Research Report > Strength High (0.93)
Research Report > New Finding (0.70)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(3 more...)

Add feedback

Socially-Aware Autonomous Driving: Inferring Yielding Intentions for Safer Interactions

Wang, Jing, Jin, Yan, Taghavifar, Hamid, Ding, Fei, Wei, Chongfeng

arXiv.org Artificial IntelligenceApr-29-2025

--Since the emergence of autonomous driving technology, it has advanced rapidly over the past decade. It is becoming increasingly likely that autonomous vehicles (A Vs) would soon coexist with human-driven vehicles (HVs) on the roads. Currently, safety and reliable decision-making remain significant challenges, particularly when A Vs are navigating lane changes and interacting with surrounding HVs. Therefore, precise estimation of the intentions of surrounding HVs can assist A Vs in making more reliable and safe lane change decision-making. This involves not only understanding their current behaviors but also predicting their future motions without any direct communication. However, distinguishing between the passing and yielding intentions of surrounding HVs still remains ambiguous. T o address the challenge, we propose a social intention estimation algorithm rooted in Directed Acyclic Graph (DAG), coupled with a decision-making framework employing Deep Reinforcement Learning (DRL) algorithms. T o evaluate the method's performance, the proposed framework can be tested and applied in a lane-changing scenario within a simulated environment. Furthermore, the experiment results demonstrate how our approach enhances the ability of A Vs to navigate lane changes safely and efficiently on roads. UTONOMOUS driving decision-making is a critical component of autonomous driving systems, aiming to make reasonable and safe driving decisions based on environmental perception [1]. The decision-making process not only needs to consider the kinematic and dynamic constraints of the vehicle but also needs to comply with traffic rules, evaluate potential risks, and coexist safely with other traffic participants in complex driving scenarios, such as executing lane changes on highways and navigating intersections, as illustrated in Figure 1. Executing lane changes on the highway remains a formidable challenge for A Vs in the real world, primarily due to environmental complexity and uncertainty. Jing Wang, Y an Jin are with the School of Mechanical and Aerospace Engineering, Queen's University Belfast, Belfast, United Kingdom (email: jwang61@qub.ac.uk, y.jin@qub.ac.uk)

artificial intelligence, machine learning, reinforcement learning, (22 more...)

arXiv.org Artificial Intelligence

2504.20004

Country:

Europe > United Kingdom > Northern Ireland > County Down > Belfast (0.24)
Europe > United Kingdom > Northern Ireland > County Antrim > Belfast (0.24)

Genre: Research Report > New Finding (0.66)

Industry:

Transportation > Ground > Road (1.00)
Information Technology (1.00)
Automobiles & Trucks (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
(2 more...)

Add feedback

Probabilistic and Causal Satisfiability: Constraining the Model

Bläser, Markus, Dörfler, Julian, Liśkiewicz, Maciej, van der Zander, Benito

arXiv.org Artificial IntelligenceApr-29-2025

We study the complexity of satisfiability problems in probabilistic and causal reasoning. Given random variables $X_1, X_2,\ldots$ over finite domains, the basic terms are probabilities of propositional formulas over atomic events $X_i = x_i$, such as $P(X_1 = x_1)$ or $P(X_1 = x_1 \vee X_2 = x_2)$. The basic terms can be combined using addition (yielding linear terms) or multiplication (polynomial terms). The probabilistic satisfiability problem asks whether a joint probability distribution satisfies a Boolean combination of (in)equalities over such terms. Fagin et al. (1990) showed that for basic and linear terms, this problem is NP-complete, making it no harder than Boolean satisfiability, while Mossé et al. (2022) proved that for polynomial terms, it is complete for the existential theory of the reals. Pearl's Causal Hierarchy (PCH) extends the probabilistic setting with interventional and counterfactual reasoning, enriching the expressiveness of languages. However, Mossé et al. (2022) found that satisfiability complexity remains unchanged. Van der Zander et al. (2023) showed that introducing a marginalization operator to languages induces a significant increase in complexity. We extend this line of work by adding two new dimensions to the problem by constraining the models. First, we fix the graph structure of the underlying structural causal model, motivated by settings like Pearl's do-calculus, and give a nearly complete landscape across different arithmetics and PCH levels. Second, we study small models. While earlier work showed that satisfiable instances admit polynomial-size models, this is no longer guaranteed with compact marginalization. We characterize the complexities of satisfiability under small-model constraints across different settings.

artificial intelligence, bayesian inference, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2504.19944

Country:

Europe > Germany (0.28)
North America > United States (0.28)

Genre:

Research Report > Strength High (0.46)
Research Report > Experimental Study (0.46)

Industry:

Health & Medicine > Therapeutic Area > Immunology (0.94)
Health & Medicine > Pharmaceuticals & Biotechnology (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.68)

Add feedback

Observational Learning with a Budget

Wu, Shuo, Poojary, Pawan, Berry, Randall

arXiv.org Artificial IntelligenceApr-29-2025

--We consider a model of Bayesian observational learning in which a sequence of agents receives a private signal about an underlying binary state of the world. Each agent makes a decision based on its own signal and its observations of previous agents. A central planner seeks to improve the accuracy of these signals by allocating a limited budget to enhance signal quality across agents. We formulate and analyze the budget allocation problem and propose two optimal allocation strategies. At least one of these strategies is shown to maximize the probability of achieving a correct information cascade. I NTRODUCTION Consider that an item, which could either be of a "good" or a "bad" quality, is up for sale in a market where agents arrive sequentially and decide whether to buy the item, with their choice serving as a recommendation for later agents. While the quality of the item is unknown to the agents, every agent has its own prior knowledge of the item's quality in the form of its private belief. Each agent then makes a payoff optimal decision based on its own prior knowledge and by observing the choices of its predecessors. Such models of "observational learning" were first studied by [1]-[3] under a Bayesian learning framework wherein each agent's prior knowledge is in the form of a privately observed signal about the pay-off-relevant state of the world, which in this case is the item's quality, and is generated from a commonly known probability distribution. A salient feature of such models is the emergence of information cascades or herding, i.e., at some point, it is optimal for an agent to ignore its own private signal and follow the actions of the past agents. Subsequent agents then follow suit due to their homogeneity.

artificial intelligence, bayesian inference, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2504.19396

Country: North America > United States > Illinois (0.28)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.74)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.34)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.34)

Add feedback