Interpretable Generalized Additive Models for Datasets with Missing Values Jon Donnelly * Department of Computer Science Department of Computer Science Duke University

Neural Information Processing Systems 

Many important datasets contain samples that are missing one or more feature values. Maintaining the interpretability of machine learning models in the presence of such missing data is challenging. Singly or multiply imputing missing values complicates the model's mapping from features to labels. On the other hand, reasoning on indicator variables that represent missingness introduces a potentially large number of additional terms, sacrificing sparsity.