Missing value imputation with adversarial random forests -- MissARF
Golchian, Pegah, Kapar, Jan, Watson, David S., Wright, Marvin N.
Handling missing values is a common challenge in biostatistical analyses, typically addressed by imputation methods. We propose a novel, fast, and easy-to-use imputation method called missing value imputation with adversarial random forests (MissARF), based on generative machine learning, that provides both single and multiple imputation. MissARF employs adversarial random forest (ARF) for density estimation and data synthesis. To impute a missing value of an observation, we condition on the non-missing values and sample from the estimated conditional distribution generated by ARF. Our experiments demonstrate that MissARF performs comparably to state-of-the-art single and multiple imputation methods in terms of imputation quality and fast runtime with no additional costs for multiple imputation.
Jul-22-2025
- Country:
- North America > United States
- Wyoming > Albany County > Laramie (0.14)
- Europe
- United Kingdom > England
- Greater London > London (0.04)
- Germany > Bremen
- Bremen (0.14)
- Denmark > Capital Region
- Copenhagen (0.04)
- United Kingdom > England
- North America > United States
- Genre:
- Research Report > New Finding (0.67)
- Industry:
- Health & Medicine > Public Health (0.45)
- Technology:
- Information Technology > Artificial Intelligence > Machine Learning
- Statistical Learning (1.00)
- Ensemble Learning (1.00)
- Decision Tree Learning (1.00)
- Neural Networks > Deep Learning (0.93)
- Information Technology > Artificial Intelligence > Machine Learning