Deep Learning Systems for Pneumothorax Detection on Chest Radiographs: A Multicenter External Validation Study

#artificialintelligence 

To assess the generalizability of a deep learning pneumothorax detection model on datasets from multiple external institutions and examine patient and acquisition factors that might influence performance. In this retrospective study, a deep learning model was trained for pneumothorax detection by merging two large open-source chest radiograph datasets: ChestX-ray14 and CheXpert. It was then tested on six external datasets from multiple independent institutions (labeled A–F) in a retrospective case-control design (data acquired between 2016 and 2019 from institutions A–E; institution F consisted of data from the MIMIC–CXR dataset). Performance on each dataset was evaluated by using area under the receiver operating characteristic curve (AUC) analysis, sensitivity, specificity, and positive and negative predictive values, with two radiologists in consensus being used as the reference standard. Patient and acquisition factors that influenced performance were analyzed.