RadLing: Towards Efficient Radiology Report Understanding

Ghosh, Rikhiya, Karn, Sanjeev Kumar, Danu, Manuela Daniela, Micu, Larisa, Vunikili, Ramya, Farri, Oladimeji

Jun-4-2023–arXiv.org Artificial Intelligence

Most natural language tasks in the radiology domain use language models pre-trained on biomedical corpus. There are few pretrained language models trained specifically for radiology, and fewer still that have been trained in a low data setting and gone on to produce comparable results in fine-tuning tasks. We present RadLing, a continuously pretrained language model using Electra-small (Clark et al., 2020) architecture, trained using over 500K radiology reports, that can compete with state-of-the-art results for fine tuning tasks in radiology domain. Our main contribution in this paper is knowledge-aware masking which is a taxonomic knowledge-assisted pretraining task that dynamically masks tokens to inject knowledge during pretraining. In addition, we also introduce an knowledge base-aided vocabulary extension to adapt the general tokenization vocabulary to radiology domain.

artificial intelligence, expert system, natural language, (17 more...)

arXiv.org Artificial Intelligence

Jun-4-2023

arXiv.org PDF

Add feedback

Country:
- Europe > Portugal (0.14)
- North America > United States (0.14)

Genre:
- Research Report (1.00)

Industry:
- Health & Medicine
  - Diagnostic Medicine > Imaging (1.00)
  - Nuclear Medicine (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Text Processing (0.48)
  - Representation & Reasoning
    - Expert Systems (0.34)
    - Ontologies (0.34)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found