A Trustworthy Industrial Fault Diagnosis Architecture Integrating Probabilistic Models and Large Language Models

wu, Yue

arXiv.org Artificial Intelligence 

Abstract: Addressing the core problem of insufficient trustworthiness in industrial fault diagnosis, stemming from the limitations of existing methods -- both traditional and deep learning - based -- in terms of interpretability, generalization, and uncertainty quantification, this paper proposes a trustworthy industrial fault diagnosis architecture, the Hierarchical Cognitive Arbitration Architecture (HCAA), which integrates probabilistic models with Large Language Models (LLMs). The architecture conducts a preliminary analysis via a diagnostic engine based on a Bayesian network and features an LLM - driven cognitive arbitration module with multimodal input capabilities. This module performs expert - level arbitration on the initial diagnosis by analyzing structured features and diagnostic charts, holding the priority to make the final decision upon detecting conflicts. To ensure the reliability of the system's output, the architecture integrates a confidence calibration module based on Temperature Scaling and a risk assessment module, which objectively quantify system trustworthiness using metrics like Expected Calibration Error (ECE). Experimental results on a dataset containing multiple fault types demonstrate that the proposed framework improves diagnostic accuracy by over 28 percentage points compared to baseline models, while the post - calibration ECE is reduced by more than 75%. Case studies confirm that the HCAA effectively corrects misjudgments from traditional models caused by complex feature patterns or knowledge gaps, providing a novel and practical engineering solution for building high - trust, explainable AI diagnostic systems for industrial applications. Keywords: Industrial Fault Diagnosis; Large Language Model (LLM); Hierarchical Cognitive Arbitration; Probabilistic Model; Confidence Calibration; Trustworthy AI 1. Introduction With the deep development of Industry 4.0 and smart manufacturing concepts, modern industrial systems are evolving towards high levels of automation and intelligence. In this process, the reliability and safety of equipment have become key factors determining production efficiency and operational costs. Prognostics and Health Management (PHM), as a core technology, plays an indispensable role in improving equipment reliability, reducing unplanned downtime, and optimizing maintenance costs by monitoring equipment status in real - time, diagnosing potential faults, and predicting remaining useful life [1], [2].