Coverage-based Outlier Explanation
Wu, Yue, Akoglu, Leman, Davidson, Ian
–arXiv.org Artificial Intelligence
Outlier detection is a core task in data mining with a plethora of algorithms that have enjoyed wide scale usage. Existing algorithms are primarily focused on detection, that is the identification of outliers in a given dataset. In this paper we explore the relatively under-studied problem of the outlier explanation problem. Our goal is, given a dataset that is already divided into outliers and normal instances, explain what characterizes the outliers. We explore the novel direction of a semantic explanation that a domain expert or policy maker is able to understand. We formulate this as an optimization problem to find explanations that are both interpretable and pure. Through experiments on real-world data sets, we quantitatively show that our method can efficiently generate better explanations compared with rule-based learners.
arXiv.org Artificial Intelligence
Nov-6-2019
- Country:
- North America > United States
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- California > Yolo County
- Davis (0.04)
- Pennsylvania > Allegheny County
- North America > United States
- Genre:
- Research Report (0.82)
- Industry:
- Health & Medicine (0.69)
- Automobiles & Trucks (0.46)