Calibrated Explanations: with Uncertainty Information and Counterfactuals

Lofstrom, Helena, Lofstrom, Tuwe, Johansson, Ulf, Sonstrod, Cecilia

Nov-4-2023–arXiv.org Artificial Intelligence

Predictive models used for AI-based decision support are generally not designed for transparency. Although they operate in critical situations such as, e.g., medicine or defence, they are limited to only presenting a probable outcome (David Gunning, 2017; Ribeiro et al., 2016), which can lead to either misuse (based on user reliance being higher than appropriate) or disuse (due to users having less reliance than appropriate) (Alvarado-Valencia & Barrero, 2014; Buçinca et al., 2020). Due to the lack of transparency, predictions from this type of model often require an explanation. In explainable artificial intelligence (XAI), the goal is to create methods that help human users identify when to trust a prediction and when not to, such as an erroneous prediction in a medical diagnosis (Marx et al., 2023). An explanation should reveal the strengths and weaknesses of the underlying model to communicate how they will behave in the future (David Gunning, 2017; Dimanov et al., 2020). There are two main categories of explanations: local explanations, which present information about the reasons for individual predictions, and global explanations, which provide information about the general behaviour of the model (Guidotti et al., 2018b; Moradi & Samwald, 2021; Martens & Foster, 2014).

explanation, prediction, probability estimate, (14 more...)

arXiv.org Artificial Intelligence

Nov-4-2023

arXiv.org PDF

Add feedback

Country:
- North America
  - United States > New York
    - New York County > New York City (0.04)
  - Canada > Ontario
    - National Capital Region > Ottawa (0.04)
- Europe > Sweden
  - Jönköping County > Jönköping (0.04)

Genre:
- Research Report (0.64)

Industry:
- Health & Medicine > Therapeutic Area (0.69)
- Government > Regional Government (0.46)

Technology:
- Information Technology
  - Data Science > Data Mining (1.00)
  - Artificial Intelligence
    - Representation & Reasoning (1.00)
    - Natural Language > Explanation & Argumentation (1.00)
    - Machine Learning (1.00)
    - Issues > Social & Ethical Issues (0.87)