AITopics | interpretable machine learning

c2ba1bc54b239208cb37b901c0d3b363-Supplemental.pdf

Neural Information Processing SystemsFeb-10-2026, 04:25:03 GMT

contribution, counterfactual explanation, explanation, (13 more...)

Neural Information Processing Systems

Industry: Banking & Finance (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

On the Safety of Interpretable Machine Learning: A Maximum Deviation Approach

Neural Information Processing SystemsDec-24-2025, 02:24:11 GMT

Interpretable and explainable machine learning has seen a recent surge of interest. We focus on safety as a key motivation behind the surge and make the relationship between interpretability and safety more quantitative. Toward assessing safety, we introduce the concept of via an optimization problem to find the largest deviation of a supervised learning model from a reference model regarded as safe. We then show how interpretability facilitates this safety assessment. For models including decision trees, generalized linear and additive models, the maximum deviation can be computed exactly and efficiently. For tree ensembles, which are not regarded as interpretable, discrete optimization techniques can still provide informative bounds. For a broader class of piecewise Lipschitz functions, we leverage the multi-armed bandit literature to show that interpretability produces tighter (regret) bounds on the maximum deviation. We present case studies, including one on mortgage approval, to illustrate our methods and the insights about models that may be obtained from deviation maximization.

interpretable machine learning, maximum deviation approach, name change, (4 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.98)

Add feedback

Neural Additive Models: Interpretable Machine Learning with Neural Nets

Neural Information Processing SystemsDec-23-2025, 21:38:59 GMT

Deep neural networks (DNNs) are powerful black-box predictors that have achieved impressive performance on a wide variety of tasks. However, their accuracy comes at the cost of intelligibility: it is usually unclear how they make their decisions. This hinders their applicability to high stakes decision-making domains such as healthcare. We propose Neural Additive Models (NAMs) which combine some of the expressivity of DNNs with the inherent intelligibility of generalized additive models. NAMs learn a linear combination of neural networks that each attend to a single input feature. These networks are trained jointly and can learn arbitrarily complex relationships between their input feature and the output. Our experiments on regression and classification datasets show that NAMs are more accurate than widely used intelligible models such as logistic regression and shallow decision trees. They perform similarly to existing state-of-the-art generalized additive models in accuracy, but are more flexible because they are based on neural nets instead of boosted trees. To demonstrate this, we show how NAMs can be used for multitask learning on synthetic data and on the COMPAS recidivism data due to their composability, and demonstrate that the differentiability of NAMs allows them to train more complex interpretable models for COVID-19.

additive model, interpretable machine learning, neural additive model, (8 more...)

Neural Information Processing Systems

Industry: Health & Medicine (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

A Further related work

Neural Information Processing SystemsAug-16-2025, 06:35:24 GMT

A toy example of this transformation is presented in Figure 5. Moreover, we can divide all of these individuals into two cases: 1. R (x We can divide all of these individuals into three cases: 1. π Figure 6: Jointly optimizing the decision policy and the counterfactual explanations can offer additional gains. Employment Length: How long the applicant has been employed. FICO Score: The applicant's FICO score, which is a credit score based on consumer credit Annual Income: The declared annual income of the applicant. Marital status: Whether the person is married or single.

contribution, counterfactual explanation, explanation, (13 more...)

Neural Information Processing Systems

Industry: Banking & Finance > Credit (0.35)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Natural Language > Explanation & Argumentation (0.47)

Add feedback

ExplainBench: A Benchmark Framework for Local Model Explanations in Fairness-Critical Applications

Afful, James

arXiv.org Artificial IntelligenceJun-10-2025

As machine learning systems are increasingly deployed in high-stakes domains such as criminal justice, finance, and healthcare, the demand for interpretable and trustworthy models has intensified. Despite the proliferation of local explanation techniques, including SHAP, LIME, and counterfactual methods, there exists no standardized, reproducible framework for their comparative evaluation, particularly in fairness-sensitive settings. We introduce ExplainBench, an open-source benchmarking suite for systematic evaluation of local model explanations across ethically consequential datasets. ExplainBench provides unified wrappers for popular explanation algorithms, integrates end-to-end pipelines for model training and explanation generation, and supports evaluation via fidelity, sparsity, and robustness metrics. The framework includes a Streamlit-based graphical interface for interactive exploration and is packaged as a Python module for seamless integration into research workflows. We demonstrate ExplainBench on datasets commonly used in fairness research, such as COMPAS, UCI Adult Income, and LendingClub, and showcase how different explanation methods behave under a shared experimental protocol. By enabling reproducible, comparative analysis of local explanations, ExplainBench advances the methodological foundations of interpretable machine learning and facilitates accountability in real-world AI systems.

explanation, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2506.0633

Country: North America > United States (0.28)

Genre: Research Report (0.82)

Industry:

Law (0.49)
Health & Medicine (0.49)
Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Explanation & Argumentation (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

On the Safety of Interpretable Machine Learning: A Maximum Deviation Approach

Neural Information Processing SystemsMay-27-2025, 01:03:53 GMT

Interpretable and explainable machine learning has seen a recent surge of interest. We focus on safety as a key motivation behind the surge and make the relationship between interpretability and safety more quantitative. Toward assessing safety, we introduce the concept of maximum deviation via an optimization problem to find the largest deviation of a supervised learning model from a reference model regarded as safe. We then show how interpretability facilitates this safety assessment. For models including decision trees, generalized linear and additive models, the maximum deviation can be computed exactly and efficiently.

artificial intelligence, interpretable machine learning, optimization problem, (2 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.63)

Add feedback

Interpretable Machine Learning for Macro Alpha: A News Sentiment Case Study

Zhang, Yuke

arXiv.org Artificial IntelligenceMay-23-2025

This study introduces an interpretable machine learning (ML) framework to extract macroeconomic alpha from global news sentiment. We process the Global Database of Events, Language, and Tone (GDELT) Project's worldwide news feed using FinBERT -- a Bidirectional Encoder Representations from Transformers (BERT) based model pretrained on finance-specific language -- to construct daily sentiment indices incorporating mean tone, dispersion, and event impact. These indices drive an XGBoost classifier, benchmarked against logistic regression, to predict next-day returns for EUR/USD, USD/JPY, and 10-year U.S. Treasury futures (ZN). Rigorous out-of-sample (OOS) backtesting (5-fold expanding-window cross-validation, OOS period: c. 2017-April 2025) demonstrates exceptional, cost-adjusted performance for the XGBoost strategy: Sharpe ratios achieve 5.87 (EUR/USD), 4.65 (USD/JPY), and 4.65 (Treasuries), with respective compound annual growth rates (CAGRs) exceeding 50% in Foreign Exchange (FX) and 22% in bonds. Shapley Additive Explanations (SHAP) affirm that sentiment dispersion and article impact are key predictive features. Our findings establish that integrating domain-specific Natural Language Processing (NLP) with interpretable ML offers a potent and explainable source of macro alpha.

artificial intelligence, machine learning, sentiment, (16 more...)

arXiv.org Artificial Intelligence

2505.16136

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Banking & Finance > Trading (1.00)
Banking & Finance > Economy (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)

Add feedback

On the Safety of Interpretable Machine Learning: A Maximum Deviation Approach

Neural Information Processing SystemsOct-10-2024, 19:07:28 GMT

Interpretable and explainable machine learning has seen a recent surge of interest. We focus on safety as a key motivation behind the surge and make the relationship between interpretability and safety more quantitative. Toward assessing safety, we introduce the concept of maximum deviation via an optimization problem to find the largest deviation of a supervised learning model from a reference model regarded as safe. We then show how interpretability facilitates this safety assessment. For models including decision trees, generalized linear and additive models, the maximum deviation can be computed exactly and efficiently.

interpretable machine learning, maximum deviation approach, safety

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.63)

Add feedback

Neural Additive Models: Interpretable Machine Learning with Neural Nets

Neural Information Processing SystemsOct-9-2024, 19:46:07 GMT

Deep neural networks (DNNs) are powerful black-box predictors that have achieved impressive performance on a wide variety of tasks. However, their accuracy comes at the cost of intelligibility: it is usually unclear how they make their decisions. This hinders their applicability to high stakes decision-making domains such as healthcare. We propose Neural Additive Models (NAMs) which combine some of the expressivity of DNNs with the inherent intelligibility of generalized additive models. NAMs learn a linear combination of neural networks that each attend to a single input feature.

interpretable machine learning, neural additive model, neural net, (5 more...)

Neural Information Processing Systems

Industry: Health & Medicine (0.51)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Neural-ANOVA: Model Decomposition for Interpretable Machine Learning

Limmer, Steffen, Udluft, Steffen, Otte, Clemens

arXiv.org Artificial IntelligenceAug-22-2024

The analysis of variance (ANOVA) decomposition offers a systematic method to understand the interaction effects that contribute to a specific decision output. In this paper we introduce Neural-ANOVA, an approach to decompose neural networks into glassbox models using the ANOVA decomposition. Our approach formulates a learning problem, which enables rapid and closed-form evaluation of integrals over subspaces that appear in the calculation of the ANOVA decomposition. Finally, we conduct numerical experiments to illustrate the advantages of enhanced interpretability and model validation by a decomposition of the learned interaction effects.

interpretable machine learning, model decomposition, neural-anova

arXiv.org Artificial Intelligence

2408.12319

Genre: Research Report (0.69)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Filters

Collaborating Authors

interpretable machine learning

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

c2ba1bc54b239208cb37b901c0d3b363-Supplemental.pdf

On the Safety of Interpretable Machine Learning: A Maximum Deviation Approach

Neural Additive Models: Interpretable Machine Learning with Neural Nets

A Further related work

ExplainBench: A Benchmark Framework for Local Model Explanations in Fairness-Critical Applications

On the Safety of Interpretable Machine Learning: A Maximum Deviation Approach

Interpretable Machine Learning for Macro Alpha: A News Sentiment Case Study

On the Safety of Interpretable Machine Learning: A Maximum Deviation Approach

Neural Additive Models: Interpretable Machine Learning with Neural Nets

Neural-ANOVA: Model Decomposition for Interpretable Machine Learning