Telling Cause from Effect using MDL-based Local and Global Regression
Marx, Alexander, Vreeken, Jilles
Telling cause from effect from observational data is one of the fundamental problems in science [26], [18]. We consider the problem of inferring the most likely direction between two univariate numeric random variables X and Y. That is, we are interested in identifying whether X causes Y, whether Y causes X, or whether they are merely correlated. Traditional methods, that rely on conditional independence tests, cannot decide between the Markov equivalent classes of X Y and Y X [18]. Recently, it has been postulated however that if X Y, there exists an independence between the marginal distribution of the cause, P (X), and the conditional distribution of the effect given the cause, P (Y X) [25], [9]. The state of the art exploits this asymmetry in various ways, and overall obtain up to 70% accuracy on a well-known benchmark of cause-effect pairs [24], [8], [20], [10], [17]. In this paper we break this barrier, and give an elegant score that is computable in linear-time and obtains over 82% accuracy on the same benchmark.
Sep-26-2017
- Country:
- North America > United States
- New York > New York County > New York City (0.04)
- Europe
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Germany
- Saarland > Saarbrücken (0.04)
- Baden-Württemberg > Tübingen Region
- Tübingen (0.04)
- United Kingdom > England
- North America > United States
- Genre:
- Research Report (1.00)