DFKI-NLP at SemEval-2024 Task 2: Towards Robust LLMs Using Data Perturbations and MinMax Training

May-1-2024–arXiv.org Artificial Intelligence

Building on the methodology outlined (NLP) has seen significant advancements, beginning by Kanakarajan and Sankarasubbu (2023), with the introduction of word embeddings we assessed the zero-shot performance of various (Mikolov et al., 2013), followed by transformer instruction-tuned LLMs to identify the most effective architectures like BERT (Vaswani et al., 2017; Devlin model. Upon selecting the best LLM, we introduced et al., 2019), and specialized language models an auxiliary module during the fine-tuning (LMs) such as BioBERT (Lee et al., 2020) and process, which emphasized learning "hard" examples. PubMedBERT (Gu et al., 2021) tailored for the Taking inspiration from Korakakis and Vlachos biomedical domain. The advent of large language (2023), who experimented with various configurations models (LLMs) like GPT-3 (Brown et al., 2020), for the auxiliary module and highlighted commonly known as Chat-GPT, has further pushed its substantial impact on the final NLI system's the boundaries of NLP, showcasing capabilities performance, we explored various architectures for in diverse NLP tasks and even reasoning.

dataset, intervention, nli4ct dataset, (16 more...)

arXiv.org Artificial Intelligence

May-1-2024

arXiv.org PDF

Add feedback

Country:
- Asia
  - Japan > Kyūshū & Okinawa
    - Kyūshū > Miyazaki Prefecture > Miyazaki (0.04)
  - Singapore (0.04)
- Europe
  - Belgium > Brussels-Capital Region
    - Brussels (0.04)
  - Germany > Brandenburg
    - Potsdam (0.04)
- North America > United States
  - Louisiana > Orleans Parish
    - New Orleans (0.04)
  - Minnesota > Hennepin County
    - Minneapolis (0.14)
  - New Mexico > Santa Fe County
    - Santa Fe (0.04)

Genre:
- Research Report > Experimental Study (1.00)

Industry:
- Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (0.68)
  - Natural Language > Large Language Model (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found