Locally Pareto-Optimal Interpretations for Black-Box Machine Learning Models
Joshi, Aniruddha, Chakraborty, Supratik, Akshay, S, Shah, Shetal, Torfah, Hazem, Seshia, Sanjit
–arXiv.org Artificial Intelligence
Creating meaningful interpretations for black-box machine learning models involves balancing two often conflicting objectives: accuracy and explainability. Exploring the trade-off between these objectives is essential for developing trustworthy interpretations. While many techniques for multi-objective interpretation synthesis have been developed, they typically lack formal guarantees on the Pareto-optimality of the results. Methods that do provide such guarantees, on the other hand, often face severe scalability limitations when exploring the Pareto-optimal space. To address this, we develop a framework based on local optimality guarantees that enables more scalable synthesis of interpretations. Specifically, we consider the problem of synthesizing a set of Pareto-optimal interpretations with local optimality guarantees, within the immediate neighborhood of each solution. Our approach begins with a multi-objective learning or search technique, such as Multi-Objective Monte Carlo Tree Search, to generate a best-effort set of Pareto-optimal candidates with respect to accuracy and explainability. We then verify local optimality for each candidate as a Boolean satisfiability problem, which we solve using a SAT solver. We demonstrate the efficacy of our approach on a set of benchmarks, comparing it against previous methods for exploring the Pareto-optimal front of interpretations. In particular, we show that our approach yields interpretations that closely match those synthesized by methods offering global guarantees.
arXiv.org Artificial Intelligence
Aug-22-2025
- Country:
- Asia > India (0.04)
- Europe
- Belgium > Wallonia
- Walloon Brabant > Louvain-la-Neuve (0.04)
- Finland > Uusimaa
- Helsinki (0.04)
- Sweden > Vaestra Goetaland
- Gothenburg (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Belgium > Wallonia
- North America
- Mexico > Quintana Roo
- Cancún (0.04)
- United States
- California
- Los Angeles County > Long Beach (0.04)
- San Diego County > San Diego (0.04)
- Colorado > Denver County
- Denver (0.04)
- District of Columbia > Washington (0.04)
- Illinois > Cook County
- Chicago (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- New York > New York County
- New York City (0.14)
- California
- Mexico > Quintana Roo
- Genre:
- Research Report > New Finding (0.68)
- Industry:
- Health & Medicine > Therapeutic Area (1.00)
- Transportation > Air (0.73)
- Technology: