On the Safety of Interpretable Machine Learning: A Maximum Deviation Approach

May-27-2025, 01:03:53 GMT–Neural Information Processing Systems

Interpretable and explainable machine learning has seen a recent surge of interest. We focus on safety as a key motivation behind the surge and make the relationship between interpretability and safety more quantitative. Toward assessing safety, we introduce the concept of maximum deviation via an optimization problem to find the largest deviation of a supervised learning model from a reference model regarded as safe. We then show how interpretability facilitates this safety assessment. For models including decision trees, generalized linear and additive models, the maximum deviation can be computed exactly and efficiently.

artificial intelligence, interpretable machine learning, optimization problem, (2 more...)

Neural Information Processing Systems

May-27-2025, 01:03:53 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (1.00)
  - Representation & Reasoning > Optimization (0.63)