AITopics | causal structure learning

A Recursive Decomposition Framework for Causal Structure Learning in the Presence of Latent Variables

Li, Zheng, Xie, Feng, Nie, Shenglan, Guo, Xichen, Wang, Ruxin, Zhang, Hao

arXiv.org Machine LearningMay-12-2026

Constraint-based causal discovery is widely used for learning causal structures, but heavy reliance on conditional independence (CI) testing makes it computationally expensive in high-dimensional settings. To mitigate this limitation, many divide-and-conquer frameworks have been proposed, but most assume causal sufficiency, i.e., no latent variables. In this paper, we show that divide-and-conquer strategies can be theoretically generalized beyond causal sufficiency to settings with latent variables. Specifically, we propose a recursive decomposition framework, termed DiCoLa, that enables divide-and-conquer causal discovery in the presence of latent variables. It recursively decomposes the global learning task into smaller subproblems and integrates their solutions through a principled reconstruction step to recover the global structure. We theoretically establish the soundness and completeness of the proposed framework. Extensive experiments on synthetic data demonstrate that our approach significantly improves computational efficiency across a range of causal discovery algorithms, while experiments on a real-world dataset further illustrate its practical effectiveness.

artificial intelligence, graph, machine learning, (13 more...)

arXiv.org Machine Learning

2605.10651

Country:

North America > United States (0.67)
Asia > China (0.46)

Genre: Research Report > New Finding (0.67)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.68)

Add feedback

Multi-domain Causal Structure Learning in Linear Systems

AmirEmad Ghassami, Negar Kiyavash, Biwei Huang, Kun Zhang

Neural Information Processing SystemsFeb-13-2026, 03:00:43 GMT

Neural Information Processing Systems http://nips.cc/

causal direction, causal order, sparse, (11 more...)

Neural Information Processing Systems

Country:

North America > Canada > Quebec > Montreal (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > Illinois > Champaign County > Urbana (0.04)
(5 more...)

Industry:

Government > Regional Government > North America Government > United States Government (0.46)
Health & Medicine > Therapeutic Area > Neurology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.51)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)

Add feedback

Amortized Inference for Causal Structure Learning

Neural Information Processing SystemsDec-24-2025, 05:37:29 GMT

Inferring causal structure poses a combinatorial search problem that typically involves evaluating structures with a score or independence test. The resulting search is costly, and designing suitable scores or tests that capture prior knowledge is difficult. In this work, we propose to amortize causal structure learning. Rather than searching over structures, we train a variational inference model to directly predict the causal structure from observational or interventional data. This allows our inference model to acquire domain-specific inductive biases for causal discovery solely from data generated by a simulator, bypassing both the hand-engineering of suitable score functions and the search over graphs. The architecture of our inference model emulates permutation invariances that are crucial for statistical efficiency in structure learning, which facilitates generalization to significantly larger problem instances than seen during training. On synthetic data and semisynthetic gene expression data, our models exhibit robust generalization capabilities when subject to substantial distribution shifts and significantly outperform existing algorithms, especially in the challenging genomics domain. Our code and models are publicly available at: https://github.com/larslorch/avici

amortized inference, causal structure learning, name change, (3 more...)

Neural Information Processing Systems

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.60)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.67)

Add feedback

Near-Optimal Multi-Perturbation Experimental Design for Causal Structure Learning

Neural Information Processing SystemsDec-23-2025, 17:31:52 GMT

Causal structure learning is a key problem in many domains. Causal structures can be learnt by performing experiments on the system of interest. We address the largely unexplored problem of designing a batch of experiments that each simultaneously intervene on multiple variables. While potentially more informative than the commonly considered single-variable interventions, selecting such interventions is algorithmically much more challenging, due to the doubly-exponential combinatorial search space over sets of composite interventions. In this paper, we develop efficient algorithms for optimizing different objective functions quantifying the informativeness of a budget-constrained batch of experiments. By establishing novel submodularity properties of these objectives, we provide approximation guarantees for our algorithms. Our algorithms empirically perform superior to both random interventions and algorithms that only select single-variable interventions.

causal structure learning, intervention, near-optimal multi-perturbation experimental design, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.46)

Add feedback

Multi-domain Causal Structure Learning in Linear Systems

AmirEmad Ghassami, Negar Kiyavash, Biwei Huang, Kun Zhang

Neural Information Processing SystemsNov-20-2025, 17:12:15 GMT

Our approach unifies the idea in those works and generalizes to the case that there is no such invariance across the domains. Our proposed methods are generally capable of identifying causal direction from fewer than ten domains.

artificial intelligence, causal direction, machine learning, (13 more...)

Neural Information Processing Systems

Country:

North America > Canada > Quebec > Montreal (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > Illinois > Champaign County > Urbana (0.04)
(5 more...)

Industry:

Government > Regional Government > North America Government > United States Government (0.46)
Health & Medicine > Therapeutic Area > Neurology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.51)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)

Add feedback

Amortized Inference for Causal Structure Learning

Neural Information Processing SystemsOct-11-2024, 02:56:22 GMT

Inferring causal structure poses a combinatorial search problem that typically involves evaluating structures with a score or independence test. The resulting search is costly, and designing suitable scores or tests that capture prior knowledge is difficult. In this work, we propose to amortize causal structure learning. Rather than searching over structures, we train a variational inference model to directly predict the causal structure from observational or interventional data. This allows our inference model to acquire domain-specific inductive biases for causal discovery solely from data generated by a simulator, bypassing both the hand-engineering of suitable score functions and the search over graphs. The architecture of our inference model emulates permutation invariances that are crucial for statistical efficiency in structure learning, which facilitates generalization to significantly larger problem instances than seen during training.

amortized inference, causal structure learning, inference model

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.90)

Add feedback

Near-Optimal Multi-Perturbation Experimental Design for Causal Structure Learning

Neural Information Processing SystemsOct-9-2024, 10:16:41 GMT

Causal structure learning is a key problem in many domains. Causal structures can be learnt by performing experiments on the system of interest. We address the largely unexplored problem of designing a batch of experiments that each simultaneously intervene on multiple variables. While potentially more informative than the commonly considered single-variable interventions, selecting such interventions is algorithmically much more challenging, due to the doubly-exponential combinatorial search space over sets of composite interventions. In this paper, we develop efficient algorithms for optimizing different objective functions quantifying the informativeness of a budget-constrained batch of experiments.

causal structure learning, intervention, near-optimal multi-perturbation experimental design, (3 more...)

Neural Information Processing Systems

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.67)

Add feedback

Applying Large Language Models for Causal Structure Learning in Non Small Cell Lung Cancer

Naik, Narmada, Khandelwal, Ayush, Joshi, Mohit, Atre, Madhusudan, Wright, Hollis, Kannan, Kavya, Hill, Scott, Mamidipudi, Giridhar, Srinivasa, Ganapati, Bifulco, Carlo, Piening, Brian, Matlock, Kevin

arXiv.org Artificial IntelligenceNov-13-2023

Causal discovery is becoming a key part in medical AI research. These methods can enhance healthcare by identifying causal links between biomarkers, demographics, treatments and outcomes. They can aid medical professionals in choosing more impactful treatments and strategies. In parallel, Large Language Models (LLMs) have shown great potential in identifying patterns and generating insights from text data. In this paper we investigate applying LLMs to the problem of determining the directionality of edges in causal discovery. Specifically, we test our approach on a deidentified set of Non Small Cell Lung Cancer(NSCLC) patients that have both electronic health record and genomic panel data. Graphs are validated using Bayesian Dirichlet estimators using tabular data. Our result shows that LLMs can accurately predict the directionality of edges in causal graphs, outperforming existing state-of-the-art methods. These findings suggests that LLMs can play a significant role in advancing causal discovery and help us better understand complex systems.

causal structure learning, dag, language model, (10 more...)

arXiv.org Artificial Intelligence

2311.07191

Country:

South America > Chile (0.04)
North America > United States > Oregon > Washington County > Beaverton (0.04)
North America > United States > Oregon > Multnomah County > Portland (0.04)
North America > United States > New York > New York County > New York City (0.04)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine > Therapeutic Area > Pulmonary/Respiratory Diseases (1.00)
Health & Medicine > Therapeutic Area > Oncology > Lung Cancer (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

Causal structure learning with momentum: Sampling distributions over Markov Equivalence Classes of DAGs

Schauer, Moritz, Wienöbst, Marcel

arXiv.org Machine LearningOct-9-2023

In the context of inferring a Bayesian network structure (directed acyclic graph, DAG for short), we devise a non-reversible continuous time Markov chain, the "Causal Zig-Zag sampler", that targets a probability distribution over classes of observationally equivalent (Markov equivalent) DAGs. The classes are represented as completed partially directed acyclic graphs (CPDAGs). The non-reversible Markov chain relies on the operators used in Chickering's Greedy Equivalence Search (GES) and is endowed with a momentum variable, which improves mixing significantly as we show empirically. The possible target distributions include posterior distributions based on a prior over DAGs and a Markov equivalent likelihood. We offer an efficient implementation wherein we develop new algorithms for listing, counting, uniformly sampling, and applying possible moves of the GES operators, all of which significantly improve upon the state-of-the-art.

artificial intelligence, bayesian inference, machine learning, (17 more...)

arXiv.org Machine Learning

2310.05655

Country:

North America > United States > California (0.28)
North America > United States > New York (0.04)
North America > Canada (0.04)
(2 more...)

Genre:

Workflow (0.46)
Research Report (0.40)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Therapeutic Area > Neurology > Alzheimer's Disease (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

Causal Structure Learning by Using Intersection of Markov Blankets

Dong, Yiran, Gao, Chuanhou

arXiv.org Artificial IntelligenceJul-1-2023

In this paper, we introduce a novel causal structure learning algorithm called Endogenous and Exogenous Markov Blankets Intersection (EEMBI), which combines the properties of Bayesian networks and Structural Causal Models (SCM). Exogenous variables are special variables that are applied in SCM. We find that exogenous variables have some special characteristics and these characteristics are still useful under the property of the Bayesian network. EEMBI intersects the Markov blankets of exogenous variables and Markov blankets of endogenous variables, i.e. the original variables, to remove the irrelevant connections and find the true causal structure theoretically. Furthermore, we propose an extended version of EEMBI, namely EEMBI-PC, which integrates the last step of the PC algorithm into EEMBI. This modification enhances the algorithm's performance by leveraging the strengths of both approaches. Plenty of experiments are provided to prove that EEMBI and EEMBI-PC have state-of-the-art performance on both discrete and continuous datasets.

artificial intelligence, bayesian inference, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2307.00227

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Asia > China > Zhejiang Province > Hangzhou (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
Asia > Middle East > Jordan (0.04)

Genre:

Research Report (0.64)
Workflow (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

Filters

Collaborating Authors

causal structure learning

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

A Recursive Decomposition Framework for Causal Structure Learning in the Presence of Latent Variables

Multi-domain Causal Structure Learning in Linear Systems

Amortized Inference for Causal Structure Learning

Near-Optimal Multi-Perturbation Experimental Design for Causal Structure Learning

Multi-domain Causal Structure Learning in Linear Systems

Amortized Inference for Causal Structure Learning

Near-Optimal Multi-Perturbation Experimental Design for Causal Structure Learning

Applying Large Language Models for Causal Structure Learning in Non Small Cell Lung Cancer

Causal structure learning with momentum: Sampling distributions over Markov Equivalence Classes of DAGs

Causal Structure Learning by Using Intersection of Markov Blankets