Explainability Is in the Mind of the Beholder: Establishing the Foundations of Explainable Artificial Intelligence

Dec-29-2021–arXiv.org Artificial Intelligence

Explainable artificial intelligence and interpretable machine learning are research fields growing in importance. Yet, the underlying concepts remain somewhat elusive and lack generally agreed definitions. While recent inspiration from social sciences has refocused the work on needs and expectations of human recipients, the field still misses a concrete conceptualisation. We take steps towards addressing this challenge by reviewing the philosophical and social foundations of human explainability, which we then translate into the technological realm. In particular, we scrutinise the notion of algorithmic black boxes and the spectrum of understanding determined by explanatory processes and explainees' background knowledge. This approach allows us to define explainability as (logical) reasoning applied to transparent insights (into black boxes) interpreted under certain background knowledge - a process that engenders understanding in explainees. We then employ this conceptualisation to revisit the much disputed trade-off between transparency and predictive power and its implications for ante-hoc and post-hoc explainers as well as fairness and accountability engendered by explainability. We furthermore discuss components of the machine learning workflow that may be in need of interpretability, building on a range of ideas from human-centred explainability, with a focus on explainees, contrastive statements and explanatory processes. Our discussion reconciles and complements current research to help better navigate open questions - rather than attempting to address any individual issue - thus laying a solid foundation for a grounded discussion and future progress of explainable artificial intelligence and interpretable machine learning. We conclude with a summary of our findings, revisiting the human-centred explanatory process needed to achieve the desired level of algorithmic transparency.

explainability, explainee, explanation, (11 more...)

arXiv.org Artificial Intelligence

Dec-29-2021

arXiv.org PDF

Add feedback

Country:
- South America > Chile
  - Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
- Oceania > Australia
  - Victoria > Melbourne (0.04)
  - New South Wales > Sydney (0.04)
- North America
  - United States
    - Virginia > Arlington County
      - Arlington (0.04)
    - New York > New York County
      - New York City (0.14)
    - California
      - San Francisco County > San Francisco (0.14)
      - Los Angeles County > Long Beach (0.04)
      - San Diego County > San Diego (0.04)
  - Canada
    - Quebec > Montreal (0.04)
    - Nova Scotia > Halifax Regional Municipality
      - Halifax (0.04)
    - British Columbia > Metro Vancouver Regional District
      - Vancouver (0.04)
- Europe
  - France (0.04)
  - United Kingdom > England
    - Bristol (0.04)
    - Oxfordshire > Oxford (0.04)
    - Cambridgeshire > Cambridge (0.04)
  - Sweden > Stockholm
    - Stockholm (0.04)
  - Spain > Catalonia
    - Barcelona Province > Barcelona (0.04)
  - Italy > Apulia
    - Bari (0.04)
  - Belgium > Flanders
    - East Flanders > Ghent (0.04)
- Asia
  - Macao (0.04)
  - China (0.04)

Genre:
- Overview (1.00)
- Research Report > New Finding (0.34)

Industry:
- Health & Medicine > Therapeutic Area (0.46)
- Government > Regional Government
  - North America Government > United States Government (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Explanation & Argumentation (1.00)
  - Machine Learning (1.00)
  - Issues > Social & Ethical Issues (1.00)