Towards Automating Text Annotation: A Case Study on Semantic Proximity Annotation using GPT-4

Yadav, Sachin, Choppa, Tejaswi, Schlechtweg, Dominik

Jul-4-2024–arXiv.org Artificial Intelligence

This paper explores using GPT-3.5 and GPT-4 to automate the data annotation process with automatic prompting techniques. The main aim of this paper is to reuse human annotation guidelines along with some annotated data to design automatic prompts for LLMs, focusing on the semantic proximity annotation task. Automatic prompts are compared to customized prompts. We further implement the prompting strategies into an open-source text annotation tool, enabling easy online use via the OpenAI API. Our study reveals the crucial role of accurate prompt design and suggests that prompting GPT-4 with human-like instructions is not straightforwardly possible for the semantic proximity task. We show that small modifications to the human guidelines already improve the performance, suggesting possible ways for future research.

computational linguistic, judgment, semanticscholar, (13 more...)

arXiv.org Artificial Intelligence

Jul-4-2024

arXiv.org PDF

Add feedback

Country:
- North America
  - Dominican Republic (0.05)
  - United States
    - New Mexico > Santa Fe County
      - Santa Fe (0.04)
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
- Europe
  - Spain > Catalonia
    - Barcelona Province > Barcelona (0.04)
  - Germany > Baden-Württemberg
    - Stuttgart Region > Stuttgart (0.05)
- Asia > Middle East
  - UAE > Abu Dhabi Emirate > Abu Dhabi (0.14)

Genre:
- Research Report > New Finding (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (1.00)
    - Chatbot (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found