A Comparison of Object Detection and Phrase Grounding Models in Chest X-ray Abnormality Localization using Eye-tracking Data

Mar-2-2025–arXiv.org Artificial Intelligence

ABSTRACT Chest diseases rank among the most prevalent and dangerous global health issues. Object detection and phrase groundin g deep learning models interpret complex radiology data to as - sist healthcare professionals in diagnosis. Object detect ion locates abnormalities for classes, while phrase grounding locates abnormalities for textual descriptions. This paper i nves-tigates how text enhances abnormality localization in ches t X-rays by comparing the performance and explainability of these two tasks. To establish an explainability benchmark, we proposed an automatic pipeline to generate image regions for report sentences using radiologists' eye-tracking dat a Index T erms -- Multi-Modal Learning, Localization, Eye-tracking Data, Data Generation, XAI 1. INTRODUCTION Since the emergence of deep neural networks (DNN), they have been applied to various medical domains and applications.

abnormality, artificial intelligence, machine learning, (19 more...)

arXiv.org Artificial Intelligence

Mar-2-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)

Genre:
- Research Report (0.65)

Industry:
- Health & Medicine
  - Diagnostic Medicine > Imaging (1.00)
  - Nuclear Medicine (1.00)
  - Therapeutic Area (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Cognitive Science (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)
  - Vision (1.00)