PLACES: Prompting Language Models for Social Conversation Synthesis
Chen, Maximillian, Papangelis, Alexandros, Tao, Chenyang, Kim, Seokhwan, Rosenbaum, Andy, Liu, Yang, Yu, Zhou, Hakkani-Tur, Dilek
–arXiv.org Artificial Intelligence
Collecting high quality conversational data can be very expensive for most applications and infeasible for others due to privacy, ethical, or similar concerns. A promising direction to tackle this problem is to generate synthetic dialogues by prompting large language models. In this work, we use a small set of expert-written conversations as in-context examples to synthesize a social conversation dataset using prompting. We perform several thorough evaluations of our synthetic conversations compared to human-collected conversations. This includes various dimensions of conversation quality with human evaluation directly on the synthesized conversations, and interactive human evaluation of chatbots fine-tuned on the synthetically generated dataset. We additionally demonstrate that this prompting approach is generalizable to multi-party conversations, providing potential to create new synthetic data for multi-party tasks. Our synthetic multi-party conversations were rated more favorably across all measured dimensions compared to conversation excerpts sampled from a human-collected multi-party dataset.
arXiv.org Artificial Intelligence
Feb-16-2023
- Country:
- Asia (1.00)
- Europe (1.00)
- North America > United States (1.00)
- Genre:
- Personal > Interview (1.00)
- Research Report (1.00)
- Industry:
- Consumer Products & Services > Food, Beverage, Tobacco & Cannabis
- Beverages (0.68)
- Education (1.00)
- Government > Regional Government (0.68)
- Health & Medicine (0.93)
- Leisure & Entertainment > Sports (0.93)
- Media
- Consumer Products & Services > Food, Beverage, Tobacco & Cannabis
- Technology: