DS@GT at CheckThat! 2025: Ensemble Methods for Detection of Scientific Discourse on Social Media

Parikh, Ayush, Truong, Hoang Thanh Thanh, Schofield, Jeanette, Heil, Maximilian

Jul-9-2025–arXiv.org Artificial Intelligence

In this paper, we, as the DS@GT team for CLEF 2025 CheckThat! Task 4a Scientific Web Discourse Detection, present the methods we explored for this task. For this multiclass classification task, we determined if a tweet contained a scientific claim, a reference to a scientific study or publication, and/or mentions of scientific entities, such as a university or a scientist. We present 3 modeling approaches for this task: transformer finetuning, few-shot prompting of LLMs, and a combined ensemble model whose design was informed by earlier experiments. Our team placed 7th in the competition, achieving a macro-averaged F1 score of 0.8611, an improvement over the DeBERTaV3 baseline of 0.8375. Our code is available on Github at https://github.com/dsgt-arc/checkthat-2025-swd/tree/main/subtask-4a.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

Jul-9-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.47)
- Europe (0.46)

Genre:
- Research Report > New Finding (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found