Learning to Reject Low-Quality Explanations via User Feedback
Stradiotti, Luca, Pesenti, Dario, Teso, Stefano, Davis, Jesse
–arXiv.org Artificial Intelligence
Machine Learning predictors are increasingly being employed in high-stakes applications such as credit scoring. Explanations help users unpack the reasons behind their predictions, but are not always "high quality''. That is, end-users may have difficulty interpreting or believing them, which can complicate trust assessment and downstream decision-making. We argue that classifiers should have the option to refuse handling inputs whose predictions cannot be explained properly and introduce a framework for learning to reject low-quality explanations (LtX) in which predictors are equipped with a rejector that evaluates the quality of explanations. In this problem setting, the key challenges are how to properly define and assess explanation quality and how to design a suitable rejector. Focusing on popular attribution techniques, we introduce ULER (User-centric Low-quality Explanation Rejector), which learns a simple rejector from human ratings and per-feature relevance judgments to mirror human judgments of explanation quality. Our experiments show that ULER outperforms both state-of-the-art and explanation-aware learning to reject strategies at LtX on eight classification and regression benchmarks and on a new human-annotated dataset, which we will publicly release to support future research.
arXiv.org Artificial Intelligence
Jul-21-2025
- Country:
- Asia > Middle East
- Jordan (0.04)
- Europe
- Belgium > Flanders
- Flemish Brabant > Leuven (0.04)
- France (0.04)
- Germany (0.04)
- Italy > Trentino-Alto Adige/Südtirol
- Trentino Province > Trento (0.04)
- Netherlands (0.04)
- Portugal > Braga
- Braga (0.04)
- Spain (0.04)
- United Kingdom > England (0.04)
- Belgium > Flanders
- North America > United States
- Georgia > Fulton County > Atlanta (0.04)
- Asia > Middle East
- Genre:
- Questionnaire & Opinion Survey (0.94)
- Research Report > Experimental Study (1.00)
- Industry:
- Health & Medicine > Diagnostic Medicine
- Imaging (0.46)
- Leisure & Entertainment > Sports
- Soccer (0.46)
- Health & Medicine > Diagnostic Medicine
- Technology: