Optimal ablation for interpretability

Oct-10-2025, 16:05:33 GMT–Neural Information Processing Systems

Interpretability work in machine learning (ML) seeks to develop tools that make models more intelligible to humans in order to better monitor model behavior and predict failure modes. Early work in interpretability sought to identify relationships between model outputs and input features (Ribeiro et al., 2016; Covert et al., 2022), but with only black-box query access to observe inputs and outputs, it can be difficult to evaluate a model's internal logic.

ablation, ablation method, vertex, (17 more...)

Neural Information Processing Systems

Oct-10-2025, 16:05:33 GMT

Conferences PDF

Add feedback

Country:
- Asia > Middle East > Jordan (0.04)

Genre:
- Research Report
  - Experimental Study (0.92)
  - New Finding (0.67)

Industry:
- Information Technology (0.67)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning
      - Neural Networks > Deep Learning (0.46)
      - Statistical Learning (1.00)
    - Natural Language (1.00)
    - Representation & Reasoning > Optimization (1.00)
    - Vision (0.92)
  - Data Science (1.00)
  - Information Management (0.92)

Duplicate Docs Excel Report

Title
c55e6792923cc16fd6ed5c3f672420a5-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found