Towards Automating Text Annotation: A Case Study on Semantic Proximity Annotation using GPT-4
Yadav, Sachin, Choppa, Tejaswi, Schlechtweg, Dominik
–arXiv.org Artificial Intelligence
This paper explores using GPT-3.5 and GPT-4 to automate the data annotation process with automatic prompting techniques. The main aim of this paper is to reuse human annotation guidelines along with some annotated data to design automatic prompts for LLMs, focusing on the semantic proximity annotation task. Automatic prompts are compared to customized prompts. We further implement the prompting strategies into an open-source text annotation tool, enabling easy online use via the OpenAI API. Our study reveals the crucial role of accurate prompt design and suggests that prompting GPT-4 with human-like instructions is not straightforwardly possible for the semantic proximity task. We show that small modifications to the human guidelines already improve the performance, suggesting possible ways for future research.
arXiv.org Artificial Intelligence
Jul-4-2024
- Country:
- North America
- Dominican Republic (0.05)
- United States
- New Mexico > Santa Fe County
- Santa Fe (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- New Mexico > Santa Fe County
- Europe
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Germany > Baden-Württemberg
- Stuttgart Region > Stuttgart (0.05)
- Spain > Catalonia
- Asia > Middle East
- UAE > Abu Dhabi Emirate > Abu Dhabi (0.14)
- North America
- Genre:
- Research Report > New Finding (0.46)
- Technology: