Prompting Implicit Discourse Relation Annotation
Yung, Frances, Ahmad, Mansoor, Scholman, Merel, Demberg, Vera
–arXiv.org Artificial Intelligence
Pre-trained large language models, such as ChatGPT, archive outstanding performance in various reasoning tasks without supervised training and were found to have outperformed crowdsourcing workers. Nonetheless, ChatGPT's performance in the task of implicit discourse relation classification, prompted by a standard multiple-choice question, is still far from satisfactory and considerably inferior to state-of-the-art supervised approaches. This work investigates several proven prompting techniques to improve ChatGPT's recognition of discourse relations. In particular, we experimented with breaking down the classification task that involves numerous abstract labels into smaller subtasks. Nonetheless, experiment results show that the inference accuracy hardly changes even with sophisticated prompt engineering, suggesting that implicit discourse relation classification is not yet resolvable under zero-shot or few-shot settings.
arXiv.org Artificial Intelligence
Feb-7-2024
- Country:
- Africa > Middle East
- Libya > Al Wahat District (0.04)
- Morocco (0.04)
- Asia
- India > Telangana
- Hyderabad (0.04)
- Middle East > UAE
- Abu Dhabi Emirate > Abu Dhabi (0.04)
- Vietnam > Hanoi
- Hanoi (0.04)
- India > Telangana
- Europe
- Czechia > Prague (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Germany
- Berlin (0.04)
- Saarland > Saarbrücken (0.04)
- Italy > Tuscany
- Florence (0.04)
- Netherlands > Utrecht (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Slovenia > Central Slovenia
- Municipality of Ljubljana > Ljubljana (0.04)
- Sweden > Vaestra Goetaland
- Gothenburg (0.04)
- North America
- Canada
- United States
- Colorado > Denver County
- Denver (0.04)
- Maryland > Baltimore (0.04)
- New York > New York County
- New York City (0.04)
- Pennsylvania > Philadelphia County
- Philadelphia (0.04)
- Colorado > Denver County
- Africa > Middle East
- Genre:
- Research Report (1.00)
- Technology: