Bivariate Causal Discovery for Categorical Data via Classification with Optimal Label Permutation
–arXiv.org Artificial Intelligence
Causal discovery for quantitative data has been extensively studied but less is known for categorical data. We propose a novel causal model for categorical data based on a new classification model, termed classification with optimal label permutation (COLP). By design, COLP is a parsimonious classifier, which gives rise to a provably identifiable causal model. A simple learning algorithm via comparing likelihood functions of causal and anti-causal models suffices to learn the causal direction. Through experiments with synthetic and real data, we demonstrate the favorable performance of the proposed COLP-based causal model compared to state-of-the-art methods. We also make available an accompanying R package COLP, which contains the proposed causal discovery algorithm and a benchmark dataset of categorical cause-effect pairs.
arXiv.org Artificial Intelligence
Dec-10-2022
- Country:
- Indian Ocean > Bass Strait (0.04)
- Oceania > Australia
- Tasmania (0.04)
- North America > United States
- Virginia > Arlington County
- Arlington (0.04)
- Texas > Brazos County
- College Station (0.14)
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- Virginia > Arlington County
- Europe > Germany
- Baden-Württemberg > Tübingen Region > Tübingen (0.05)
- Genre:
- Research Report (1.00)