Domain Generalization and Adaptation in Intensive Care with Anchor Regression
Londschien, Malte, Burger, Manuel, Rätsch, Gunnar, Bühlmann, Peter
–arXiv.org Artificial Intelligence
The performance of predictive models in clinical settings often degrades when deployed in new hospitals due to distribution shifts. This paper presents a large-scale study of causality-inspired domain generalization on heterogeneous multi-center intensive care unit (ICU) data. We apply anchor regression and introduce anchor boosting, a novel, tree-based nonlinear extension, to a large dataset comprising 400,000 patients from nine distinct ICU databases. The anchor regularization consistently improves out-of-distribution performance, particularly for the most dissimilar target domains. The methods appear robust to violations of theoretical assumptions, such as anchor exogeneity. Furthermore, we propose a novel conceptual framework to quantify the utility of large external data datasets. By evaluating performance as a function of available target-domain data, we identify three regimes: (i) a domain generalization regime, where only the external model should be used, (ii) a domain adaptation regime, where refitting the external model is optimal, and (iii) a data-rich regime, where external data provides no additional value.
arXiv.org Artificial Intelligence
Jul-30-2025
- Country:
- Asia
- China (0.04)
- Middle East > Israel (0.04)
- Europe
- Austria > Salzburg
- Salzburg (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Switzerland > Zürich
- Zürich (0.14)
- Austria > Salzburg
- North America > United States
- Illinois > Cook County > Chicago (0.04)
- Asia
- Genre:
- Research Report > New Finding (0.67)
- Industry:
- Technology: