Bridging AI and Clinical Practice: Integrating Automated Sleep Scoring Algorithm with Uncertainty-Guided Physician Review
Bechny, Michal, Monachino, Giuliana, Fiorillo, Luigi, van der Meer, Julia, Schmidt, Markus H., Bassetti, Claudio L. A., Tzovara, Athina, Faraci, Francesca D.
–arXiv.org Artificial Intelligence
Purpose: This study aims to enhance the clinical use of automated sleep-scoring algorithms by incorporating an uncertainty estimation approach to efficiently assist clinicians in the manual review of predicted hypnograms, a necessity due to the notable inter-scorer variability inherent in polysomnography (PSG) databases. Our efforts target the extent of review required to achieve predefined agreement levels, examining both in-domain and out-of-domain data, and considering subjects diagnoses. Patients and methods: Total of 19578 PSGs from 13 open-access databases were used to train U-Sleep, a state-of-the-art sleep-scoring algorithm. We leveraged a comprehensive clinical database of additional 8832 PSGs, covering a full spectrum of ages and sleep-disorders, to refine the U-Sleep, and to evaluate different uncertainty-quantification approaches, including our novel confidence network. The ID data consisted of PSGs scored by over 50 physicians, and the two OOD sets comprised recordings each scored by a unique senior physician. Results: U-Sleep demonstrated robust performance, with Cohen's kappa (K) at 76.2% on ID and 73.8-78.8% on OOD data. The confidence network excelled at identifying uncertain predictions, achieving AUROC scores of 85.7% on ID and 82.5-85.6% on OOD data. Independently of sleep-disorder status, statistical evaluations revealed significant differences in confidence scores between aligning vs discording predictions, and significant correlations of confidence scores with classification performance metrics. To achieve K of at least 90% with physician intervention, examining less than 29.0% of uncertain epochs was required, substantially reducing physicians workload, and facilitating near-perfect agreement.
arXiv.org Artificial Intelligence
Dec-22-2023
- Country:
- Europe
- Germany > Bavaria
- Upper Bavaria > Munich (0.04)
- Switzerland > Bern
- Bern (0.04)
- Germany > Bavaria
- North America > United States
- Ohio (0.04)
- Europe
- Genre:
- Research Report
- Experimental Study (0.68)
- New Finding (1.00)
- Research Report
- Industry:
- Health & Medicine > Therapeutic Area
- Neurology (1.00)
- Psychiatry/Psychology (0.69)
- Sleep (0.66)
- Health & Medicine > Therapeutic Area
- Technology: