Recover Experimental Data with Selection Bias using Counterfactual Logic

Jun-5-2025–arXiv.org Artificial Intelligence

Selection bias, arising from the systematic inclusion or exclusion of certain samples, poses a significant challenge to the validity of causal inference. While Bareinboim et al. introduced methods for recovering unbiased observational and interventional distributions from biased data using partial external information, the complexity of the backdoor adjustment and the method's strong reliance on observational data limit its applicability in many practical settings. In this paper, we formally discover the recoverability of $P(Y^*_{x^*})$ under selection bias with experimental data. By explicitly constructing counterfactual worlds via Structural Causal Models (SCMs), we analyze how selection mechanisms in the observational world propagate to the counterfactual domain. We derive a complete set of graphical and theoretical criteria to determine that the experimental distribution remain unaffected by selection bias. Furthermore, we propose principled methods for leveraging partially unbiased observational data to recover $P(Y^*_{x^*})$ from biased experimental datasets. Simulation studies replicating realistic research scenarios demonstrate the practical utility of our approach, offering concrete guidance for mitigating selection bias in applied causal inference.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

Jun-5-2025

arXiv.org PDF

Add feedback

Genre:
- Research Report
  - New Finding (0.93)
  - Experimental Study (0.93)
  - Strength High (0.67)

Industry:
- Health & Medicine > Pharmaceuticals & Biotechnology (0.68)

Technology:
- Information Technology
  - Enterprise Applications > Customer Relationship Management (1.00)
  - Artificial Intelligence
    - Natural Language (0.93)
    - Representation & Reasoning > Uncertainty (0.93)
    - Machine Learning (0.93)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found