DiPlomat: A Dialogue Dataset for Situated Pragmatic Reasoning
Li, Hengli, Zhu, Song-Chun, Zheng, Zilong
–arXiv.org Artificial Intelligence
Pragmatic reasoning plays a pivotal role in deciphering implicit meanings that frequently arise in real-life conversations and is essential for the development of communicative social agents. In this paper, we introduce a novel challenge, DiPlomat, aiming at benchmarking machines' capabilities on pragmatic reasoning and situated conversational understanding. Compared with previous works that treat different figurative expressions (e.g. metaphor, sarcasm) as individual tasks, DiPlomat provides a cohesive framework towards general pragmatic understanding. Our dataset is created through the utilization of Amazon Mechanical Turk ( AMT ), resulting in a total of 4, 177 multi-turn dialogues. In conjunction with the dataset, we propose two tasks, Pragmatic Identification and Reasoning (PIR) and Conversational Question Answering (CQA). Experimental results with state-of-the-art (SOTA) neural architectures reveal several significant findings: 1) large language models ( LLMs) exhibit poor performance in tackling this subjective domain; 2) comprehensive comprehension of context emerges as a critical factor for establishing benign human-machine interactions; 3) current models defect in the application of pragmatic reasoning. As a result, we call on more attention to improve the ability of context understanding, reasoning, and implied meaning modeling.
arXiv.org Artificial Intelligence
Jun-19-2023
- Country:
- Atlantic Ocean (0.04)
- Oceania
- New Zealand (0.04)
- Australia (0.04)
- North America
- Canada (0.04)
- United States
- Colorado (0.04)
- Pennsylvania (0.04)
- Ohio (0.04)
- New York (0.04)
- Michigan (0.04)
- Washington > King County
- Seattle (0.04)
- Florida > Leon County
- Tallahassee (0.04)
- Europe
- United Kingdom > England
- Greater Manchester > Manchester (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Czechia
- South Moravian Region > Brno (0.04)
- Prague (0.04)
- United Kingdom > England
- Asia
- Singapore (0.04)
- Middle East
- Republic of Türkiye (0.14)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.04)
- China > Beijing
- Beijing (0.04)
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Leisure & Entertainment (1.00)
- Law (1.00)
- Health & Medicine (1.00)
- Information Technology (0.67)
- Government > Regional Government
- Technology: