Federated Causal Inference in Healthcare: Methods, Challenges, and Applications
Li, Haoyang, Xu, Jie, Gan, Kyra, Wang, Fei, Zang, Chengxi
–arXiv.org Artificial Intelligence
Federated causal inference enables multi-site treatment effect estimation without sharing individual-level data, offering a privacy-preserving solution for real-world evidence generation. However, data heterogeneity across sites, manifested in differences in covariate, treatment, and outcome, poses significant challenges for unbiased and efficient estimation. In this paper, we present a comprehensive review and theoretical analysis of federated causal effect estimation across both binary/continuous and time-to-event outcomes. We classify existing methods into weight-based strategies and optimization-based frameworks and further discuss extensions including personalized models, peer-to-peer communication, and model decomposition. For time-to-event outcomes, we examine federated Cox and Aalen-Johansen models, deriving asymptotic bias and variance under heterogeneity. Our analysis reveals that FedProx-style regularization achieves near-optimal bias-variance trade-offs compared to naive averaging and meta-analysis. We review related software tools and conclude by outlining opportunities, challenges, and future directions for scalable, fair, and trustworthy federated causal inference in distributed healthcare systems.
arXiv.org Artificial Intelligence
May-6-2025
- Country:
- North America > United States (0.93)
- Genre:
- Overview (1.00)
- Research Report
- New Finding (1.00)
- Experimental Study (1.00)
- Industry:
- Information Technology > Security & Privacy (1.00)
- Health & Medicine
- Pharmaceuticals & Biotechnology (1.00)
- Consumer Health (0.93)
- Health Care Technology (0.68)
- Therapeutic Area
- Oncology (1.00)
- Infections and Infectious Diseases (1.00)
- Immunology (1.00)
- Pulmonary/Respiratory Diseases (0.94)
- Psychiatry/Psychology (0.93)
- Neurology > Alzheimer's Disease (0.46)
- Technology:
- Information Technology
- Software (1.00)
- Security & Privacy (1.00)
- Data Science > Data Mining (1.00)
- Communications (0.93)
- Artificial Intelligence
- Representation & Reasoning (1.00)
- Machine Learning > Statistical Learning (1.00)
- Information Technology