AITopics | interpretability method

Collaborating Authors

interpretability method

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

b4aadf04d6fde46346db455402860708-Paper-Conference.pdf

Neural Information Processing SystemsApr-29-2026, 12:24:53 GMT

artificial intelligence, interpretability, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States (0.46)
Europe > Germany (0.28)

Genre: Research Report > New Finding (0.93)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Evaluating the Robustness of Interpretability Methods through Explanation Invariance and Equivariance

Neural Information Processing SystemsFeb-17-2026, 14:32:05 GMT

Any explanation that faithfully explains this type of model needs to be in agreement with this invariance property.

data mining, interpretability method, machine learning, (21 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > Oregon > Multnomah County > Portland (0.04)
North America > United States > Florida > Broward County > Fort Lauderdale (0.04)
(2 more...)

Industry: Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(2 more...)

Add feedback

Scale Alone Does not Improve Mechanistic Interpretability in Vision Models

Neural Information Processing SystemsFeb-16-2026, 16:33:35 GMT

In light of the recent widespread adoption of AI systems, understanding the internal information processing of neural networks has become increasingly critical.

artificial intelligence, interpretability, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.05)
Oceania > New Zealand (0.04)
(5 more...)

Genre: Research Report > New Finding (0.93)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

A Benchmark for Interpretability Methods in Deep Neural Networks

Sara Hooker, Dumitru Erhan, Pieter-Jan Kindermans, Been Kim

Neural Information Processing SystemsFeb-15-2026, 09:42:23 GMT

Neural Information Processing Systems http://nips.cc/

accuracy, base estimator, estimator, (14 more...)

Neural Information Processing Systems

Country:

Oceania > Australia > Victoria > Melbourne (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > Canada (0.04)
(2 more...)

Genre: Research Report (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.85)

Add feedback

Learning outside the Black-Box: The pursuit of interpretable models

Neural Information Processing SystemsFeb-10-2026, 11:14:28 GMT

Machine Learning has proved its ability to produce accurate models - but the deployment of these models outside the machine learning community has been hindered by the difficulties of interpreting these models.

artificial intelligence, machine learning, meijer g-function, (17 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > Canada (0.04)
Asia > Middle East > Jordan (0.04)

Industry: Transportation > Air (0.44)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

47a3893cc405396a5c30d91320572d6d-Paper.pdf

Neural Information Processing SystemsFeb-8-2026, 07:15:42 GMT

dataset, saliency method, time step, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania (0.04)
North America > United States > Maryland (0.04)
North America > Canada (0.04)
Europe > Italy > Marche > Ancona Province > Ancona (0.04)

Industry: Health & Medicine > Health Care Technology (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

FIND: A Function Description Benchmark for Evaluating Interpretability Methods

Neural Information Processing SystemsDec-27-2025, 04:13:01 GMT

Labeling neural network submodules with human-legible descriptions is useful for many downstream tasks: such descriptions can surface failures, guide interventions, and perhaps even explain important model behaviors. To date, most mechanistic descriptions of trained networks have involved small models, narrowly delimited phenomena, and large amounts of human labor. Labeling all human-interpretable sub-computations in models of increasing size and complexity will almost certainly require tools that can generate and validate descriptions automatically. Recently, techniques that use learned models in-the-loop for labeling have begun to gain traction, but methods for evaluating their efficacy are limited and ad-hoc. How should we validate and compare open-ended labeling tools?

function description benchmark, interpretability method, name change, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.61)

Add feedback

Evaluating the Robustness of Interpretability Methods through Explanation Invariance and Equivariance

Neural Information Processing SystemsDec-27-2025, 00:43:07 GMT

Interpretability methods are valuable only if their explanations faithfully describe the explained model. In this work, we consider neural networks whose predictions are invariant under a specific symmetry group. This includes popular architectures, ranging from convolutional to graph neural networks. Any explanation that faithfully explains this type of model needs to be in agreement with this invariance property.

explanation invariance and equivariance, interpretability method, symmetry group, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.64)

Add feedback

A Benchmark for Interpretability Methods in Deep Neural Networks

Neural Information Processing SystemsDec-26-2025, 04:37:25 GMT

We propose an empirical measure of the approximate accuracy of feature importance estimates in deep neural networks. Our results across several large-scale image classification datasets show that many popular interpretability methods produce estimates of feature importance that are not better than a random designation of feature importance.

deep neural network, interpretability method, name change, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.73)

Add feedback