A theoretical study of Y structures for causal discovery
Mani, Subramani, Spirtes, Peter L., Cooper, Gregory F.
–arXiv.org Artificial Intelligence
There are several existing algorithms that under appropriate assumptions can reliably identify a subset of the underlying causal relationships from observational data. This paper introduces the first computationally feasible score-based algorithm that can reliably identify causal relationships in the large sample limit for discrete models, while allowing for the possibility that there are unobserved common causes. In doing so, the algorithm does not ever need to assign scores to causal structures with unobserved common causes. The algorithm is based on the identification of so called Y substructures within Bayesian network structures that can be learned from observational data. An example of a Y substructure is A -> C, B -> C, C -> D. After providing background on causal discovery, the paper proves the conditions under which the algorithm is reliable in the large sample limit.
arXiv.org Artificial Intelligence
Jun-27-2012
- Country:
- North America > United States
- New York (0.04)
- Wisconsin > Milwaukee County
- Milwaukee (0.04)
- Tennessee > Davidson County
- Nashville (0.04)
- Pennsylvania > Allegheny County
- Pittsburgh (0.14)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- California
- San Francisco County > San Francisco (0.28)
- Los Angeles County > Los Angeles (0.04)
- Europe > United Kingdom
- England > Oxfordshire > Oxford (0.14)
- North America > United States
- Genre:
- Research Report (0.50)
- Industry:
- Health & Medicine > Therapeutic Area (0.46)