Error Discovery by Clustering Influence Embeddings

Wang, Fulton, Adebayo, Julius, Tan, Sarah, Garcia-Olano, Diego, Kokhlikyan, Narine

Dec-7-2023–arXiv.org Artificial Intelligence

We present a method for identifying groups of test examples -- slices -- on which a model under-performs, a task now known as slice discovery. We formalize coherence -- a requirement that erroneous predictions, within a slice, should be wrong for the same reason -- as a key property that any slice discovery method should satisfy. We then use influence functions to derive a new slice discovery method, InfEmbed, which satisfies coherence by returning slices whose examples are influenced similarly by the training data. InfEmbed is simple, and consists of applying K-Means clustering to a novel representation we deem influence embeddings. We show InfEmbed outperforms current state-of-the-art methods on 2 benchmarks, and is effective for model debugging across several case studies.

dataset, infembed, prediction, (15 more...)

arXiv.org Artificial Intelligence

Dec-7-2023

arXiv.org PDF

Add feedback

Country:
- Europe > United Kingdom (0.04)
- North America > United States
  - New Mexico > Bernalillo County > Albuquerque (0.04)
- Asia > British Indian Ocean Territory
  - Diego Garcia (0.04)
- Africa
  - Côte d'Ivoire (0.04)
  - Nigeria > Lagos State
    - Lagos (0.04)

Genre:
- Research Report (0.83)

Industry:
- Health & Medicine
  - Diagnostic Medicine > Imaging (1.00)
  - Nuclear Medicine (0.92)
  - Epidemiology (0.68)
  - Therapeutic Area
    - Infections and Infectious Diseases (1.00)
    - Immunology (1.00)
    - Pulmonary/Respiratory Diseases (0.68)
- Government > Regional Government
  - North America Government > United States Government (0.46)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Neural Networks > Deep Learning (0.67)
  - Statistical Learning > Clustering (0.66)