Intuitively Assessing ML Model Reliability through Example-Based Explanations and Editing Model Inputs
Suresh, Harini, Lewis, Kathleen M., Guttag, John V., Satyanarayan, Arvind
–arXiv.org Artificial Intelligence
Interpretability methods aim to help users build trust in and understand the capabilities of machine learning models. However, existing approaches often rely on abstract, complex visualizations that poorly map to the task at hand or require non-trivial ML expertise to interpret. Here, we present two interface modules to facilitate a more intuitive assessment of model reliability. To help users better characterize and reason about a model's uncertainty, we visualize raw and aggregate information about a given input's nearest neighbors in the training dataset. Using an interactive editor, users can manipulate this input in semantically-meaningful ways, determine the effect on the output, and compare against their prior expectations. We evaluate our interface using an electrocardiogram beat classification case study. Compared to a baseline feature importance interface, we find that 9 physicians are better able to align the model's uncertainty with clinically relevant factors and build intuition about its capabilities and limitations.
arXiv.org Artificial Intelligence
Feb-16-2021
- Country:
- Oceania > Australia
- New South Wales > Sydney (0.04)
- North America > United States
- Texas > Dallas County
- Dallas (0.04)
- New York > New York County
- New York City (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.14)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Georgia > Fulton County
- Atlanta (0.14)
- California
- San Francisco County > San Francisco (0.14)
- Los Angeles County > Long Beach (0.04)
- Texas > Dallas County
- Europe
- United Kingdom > England
- Hampshire > Southampton (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Italy > Piedmont
- Turin Province > Turin (0.04)
- United Kingdom > England
- Oceania > Australia
- Genre:
- Research Report > Experimental Study (0.46)
- Industry:
- Technology: