The FIX Benchmark: Extracting Features Interpretable to eXperts

Jin, Helen, Havaldar, Shreya, Kim, Chaehyeon, Xue, Anton, You, Weiqiu, Qu, Helen, Gatti, Marco, Hashimoto, Daniel A, Jain, Bhuvnesh, Madani, Amin, Sako, Masao, Ungar, Lyle, Wong, Eric

Dec-23-2024–arXiv.org Artificial Intelligence

Feature-based methods are commonly used to explain model predictions, but these methods often implicitly assume that interpretable features are readily available. However, this is often not the case for high-dimensional data, and it can be hard even for domain experts to mathematically specify which features are important. Can we instead automatically extract collections or groups of features that are aligned with expert knowledge? To address this gap, we present FIX (Features Interpretable to eXperts), a benchmark for measuring how well a collection of features aligns with expert knowledge. In collaboration with domain experts, we propose FIXScore, a unified expert alignment measure applicable to diverse real-world settings across cosmology, psychology, and medicine domains in vision, language, and time series data modalities. With FIXScore, we find that popular feature-based explanation methods have poor alignment with expert-specified knowledge, highlighting the need for new methods that can better identify features interpretable to experts.

data mining, machine learning, natural language, (23 more...)

arXiv.org Artificial Intelligence

Dec-23-2024

arXiv.org PDF

Add feedback

Country:
- South America > Uruguay
  - Maldonado > Maldonado (0.04)
- North America
  - United States
    - Pennsylvania (0.04)
    - New York > New York County
      - New York City (0.04)
    - California > Santa Clara County
      - Palo Alto (0.04)
  - Mexico > Mexico City
    - Mexico City (0.04)
  - Canada > Ontario
    - Toronto (0.14)
- Europe
  - Switzerland > Zürich
    - Zürich (0.14)
  - Germany > North Rhine-Westphalia
    - Upper Bavaria > Munich (0.04)
  - France > Grand Est
    - Bas-Rhin > Strasbourg (0.04)
- Asia
  - Singapore (0.04)
  - Japan (0.04)
  - Indonesia > Bali (0.04)

Genre:
- Overview (0.67)

Industry:
- Law (0.67)
- Health & Medicine
  - Diagnostic Medicine > Imaging (1.00)
  - Therapeutic Area > Gastroenterology (0.68)

Technology:
- Information Technology
  - Data Science > Data Mining (0.93)
  - Sensing and Signal Processing > Image Processing (0.93)
  - Communications > Social Media (0.68)
  - Human Computer Interaction (0.67)
  - Artificial Intelligence
    - Vision (1.00)
    - Representation & Reasoning (1.00)
    - Natural Language (1.00)
    - Cognitive Science (0.67)
    - Machine Learning > Neural Networks
      - Deep Learning (0.93)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found