Analysing zero-shot temporal relation extraction on clinical notes using temporal consistency

Kougia, Vasiliki, Sedova, Anastasiia, Stephan, Andreas, Zaporojets, Klim, Roth, Benjamin

Jun-17-2024–arXiv.org Artificial Intelligence

This paper presents the first study for temporal relation extraction in a zero-shot setting focusing on biomedical text. We employ two types of prompts and five LLMs (GPT-3.5, Mixtral, Llama 2, Gemma, and PMC-LLaMA) to obtain responses about the temporal relations between two events. Our experiments demonstrate that LLMs struggle in the zero-shot setting performing worse than fine-tuned specialized models in terms of F1 score, showing that this is a challenging task for LLMs. We further contribute a novel comprehensive temporal analysis by calculating consistency scores for each LLM. Our findings reveal that LLMs face challenges in providing responses consistent to the temporal properties of uniqueness and transitivity. Figure 1: An example of three event pairs annotated Moreover, we study the relation between the with temporal relations. In the right part, the order of temporal consistency of an LLM and its accuracy the events with respect to time (t) is shown and the and whether the latter can be improved by consistency of uniqueness and transitivity.

consistency, prediction, relation, (14 more...)

arXiv.org Artificial Intelligence

Jun-17-2024

arXiv.org PDF

Add feedback

Country:
- Asia > Singapore (0.04)
- Oceania > Australia
  - Victoria > Melbourne (0.04)
- North America > Canada
  - Ontario > Toronto (0.04)
  - British Columbia > Metro Vancouver Regional District
    - Vancouver (0.04)
- Europe
  - Austria > Vienna (0.15)
  - Denmark
    - Central Jutland > Aarhus (0.04)
    - Capital Region > Copenhagen (0.04)

Genre:
- Research Report > New Finding (0.48)

Industry:
- Health & Medicine > Health Care Technology > Medical Record (0.41)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found