Learning Neural Causal Models with Active Interventions

Scherrer, Nino, Bilaniuk, Olexa, Annadani, Yashas, Goyal, Anirudh, Schwab, Patrick, Schölkopf, Bernhard, Mozer, Michael C., Bengio, Yoshua, Bauer, Stefan, Ke, Nan Rosemary

Sep-6-2021–arXiv.org Machine Learning

Discovering causal structures from data is a challenging inference problem of fundamental importance in all areas of science. The appealing scaling properties of neural networks have recently led to a surge of interest in differentiable neural network-based methods for learning causal structures from data. So far differentiable causal discovery has focused on static datasets of observational or interventional origin. In this work, we introduce an active intervention-targeting mechanism which enables a quick identification of the underlying causal structure of the data-generating process. Our method significantly reduces the required number of interactions compared with random intervention targeting and is applicable for both discrete and continuous optimization formulations of learning the underlying directed acyclic graph (DAG) from data. We examine the proposed method across a wide range of settings and demonstrate superior performance on multiple benchmarks from simulated to real-world data. Learning causal structure from data is a challenging but important task that lies at the heart of scientific reasoning and accompanying progress in many disciplines (Sachs et al., 2005; Hill et al., 2016; Lauritzen & Spiegelhalter, 1988; Korb & Nicholson, 2010). While there exists a plethora of methods for the task, computationally and statistically more efficient algorithms are highly desired (Heinze-Deml et al., 2018). As a result, there has been a surge in interest in differentiable structure learning and the combination of deep learning and causal inference (Schölkopf et al., 2021). However, the improvement critically depends on the experiments and interventions available. Despite advances in high-throughput methods for interventional data in specific fields (Dixit et al., 2016), the acquisition of interventional samples in the general settings tends to be costly, technically impossible or even unethical for specific interventions.

graph, neural network, optimization problem, (21 more...)

arXiv.org Machine Learning

Sep-6-2021

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.28)

Genre:
- Research Report (1.00)

Industry:
- Health & Medicine > Pharmaceuticals & Biotechnology (0.68)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Learning Graphical Models > Directed Networks
      - Bayesian Learning (1.00)
    - Neural Networks (1.00)
  - Representation & Reasoning
    - Optimization (1.00)
    - Uncertainty > Bayesian Inference (1.00)