Unboxing the Black Box: Mechanistic Interpretability for Algorithmic Understanding of Neural Networks
Kowalska, Bianka, Kwaśnicka, Halina
–arXiv.org Artificial Intelligence
Artificial intelligence (AI) is increasingly assisting us in a wide range of tasks, from everyday applications like recommendation systems to high-risk domains such as bio-metric recognition, autonomous vehicles, and medical diagnosis [1]. In particular, the rise of transformer-based models, such as those used in natural language processing (NLP), has significantly accelerated AI's adoption and visibility in society, enabling breakthroughs in fields like text generation, translation, and image understanding [2]. The size, complexity, and opacity of deep learning models are growing exponentially, further outpacing the ability of researchers to understand the black box. As deep neural networks are increasingly deployed in real-world applications with more advanced use cases, the impact of AI continues to grow. This growing influence, coupled with the often opaque, black-box nature of most AI systems, has led to a heightened demand for AI models that are both faithful and explainable. The validation of AI's decisions is especially critical in high-risks areas, such as law or medicine [3, 4]. As a result, Explainable AI (XAI) emerged as a direct response to companies' and researchers' demands to interpret, explain and validate neural networks to make AI systems trustworthy. XAI encompasses all methods, approaches and efforts to uncover the reasoning and behavior of artificial intelligence systems [1]. Thus, it is important to establish an understanding of common terms used in the XAI literature, despite the lack of universally accepted definitions.
arXiv.org Artificial Intelligence
Nov-25-2025
- Country:
- Africa > Rwanda
- Asia
- China > Hong Kong (0.04)
- Indonesia > Bali (0.04)
- Middle East
- Republic of Türkiye > Karaman Province
- Karaman (0.04)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.04)
- Republic of Türkiye > Karaman Province
- Singapore (0.04)
- South Korea > Seoul
- Seoul (0.04)
- Thailand > Bangkok
- Bangkok (0.04)
- Europe
- Austria > Vienna (0.14)
- France > Île-de-France
- Italy > Tuscany
- Florence (0.04)
- Poland > Lower Silesia Province
- Wroclaw (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- United Kingdom > England
- Oxfordshire > Oxford (0.04)
- North America
- Canada
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- Ontario > Toronto (0.04)
- British Columbia > Metro Vancouver Regional District
- Dominican Republic (0.04)
- Mexico > Mexico City
- Mexico City (0.04)
- United States
- California
- Los Angeles County > Long Beach (0.04)
- San Francisco County > San Francisco (0.14)
- Santa Clara County > Palo Alto (0.04)
- District of Columbia > Washington (0.04)
- Florida > Miami-Dade County
- Miami (0.14)
- Illinois > Cook County
- Chicago (0.04)
- Maryland (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- New York (0.04)
- North Carolina > Wake County
- Raleigh (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- California
- Canada
- Oceania > Australia
- Genre:
- Overview (1.00)
- Research Report > New Finding (0.46)
- Industry:
- Health & Medicine (1.00)
- Transportation > Air (0.80)
- Technology: