FZI-WIM at SemEval-2024 Task 2: Self-Consistent CoT for Complex NLI in Biomedical Domain

Jun-14-2024–arXiv.org Artificial Intelligence

This paper describes the inference system of FZI-WIM at the SemEval-2024 Task 2: Safe Biomedical Natural Language Inference for Clinical Trials. Our system utilizes the chain of thought (CoT) paradigm to tackle this complex reasoning problem and further improves the CoT performance with self-consistency. Instead of greedy decoding, we sample multiple reasoning chains with the same prompt and make the final verification with majority voting. The self-consistent CoT system achieves a baseline F1 score of 0.80 (1st), faithfulness score of 0.90 (3rd), and consistency score of 0.73 (12th). We release the code and data publicly https://github.com/jens5588/FZI-WIM-NLI4CT.

clinical trial report, participant, trial report, (16 more...)

arXiv.org Artificial Intelligence

Jun-14-2024

arXiv.org PDF

Add feedback

Country:
- North America
  - United States > Minnesota
    - Hennepin County > Minneapolis (0.14)
  - Canada > Ontario
    - Toronto (0.04)
- Europe
  - Romania > Sud - Muntenia Development Region
    - Giurgiu County > Giurgiu (0.04)
  - Germany > Baden-Württemberg
    - Karlsruhe Region > Karlsruhe (0.04)

Genre:
- Research Report > Experimental Study (1.00)

Industry:
- Health & Medicine
  - Pharmaceuticals & Biotechnology (1.00)
  - Therapeutic Area
    - Oncology (0.50)
    - Gastroenterology (0.49)
    - Cardiology/Vascular Diseases (0.47)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (0.72)
  - Machine Learning > Neural Networks (0.47)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found