What to Expect of Classifiers? Reasoning about Logistic Regression with Missing Features

Khosravi, Pasha, Liang, Yitao, Choi, YooJung, Broeck, Guy Van den

Mar-4-2019–arXiv.org Artificial Intelligence

While discriminative classifiers often yield strong predictive performance, missing feature values at prediction time can still be a challenge. Classifiers may not behave as expected under certain ways of substituting the missing values, since they inherently make assumptions about the data distribution they were trained on. In this paper, we propose a novel framework that classifies examples with missing features by computing the expected prediction on a given feature distribution. We then use geometric programming to learn a naive Bayes distribution that embeds a given logistic regression classifier and can efficiently take its expected predictions. Empirical evaluations show that our model achieves the same performance as the logistic regression with all features observed, and outperforms standard imputation techniques when features go missing during prediction time. Furthermore, we demonstrate that our method can be used to generate 'sufficient explanations' of logistic regression classifications, by removing features that do not affect the classification.

artificial intelligence, classifier, machine learning, (16 more...)

arXiv.org Artificial Intelligence

Mar-4-2019

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - California
    - Los Angeles County > Los Angeles (0.14)
    - San Francisco County > San Francisco (0.04)
- Asia > Middle East
  - Jordan (0.05)

Genre:
- Research Report
  - New Finding (0.92)
  - Experimental Study (0.78)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Statistical Learning > Regression (1.00)
  - Learning Graphical Models > Directed Networks
    - Bayesian Learning (0.93)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found