Simulating Task-Oriented Dialogues with State Transition Graphs and Large Language Models
Samarinas, Chris, Promthaw, Pracha, Nijasure, Atharva, Zeng, Hansi, Killingback, Julian, Zamani, Hamed
–arXiv.org Artificial Intelligence
This paper explores SynTOD, a new synthetic data generation approach for developing end-to-end Task-Oriented Dialogue (TOD) Systems capable of handling complex tasks such as intent classification, slot filling, conversational question-answering, and retrieval-augmented response generation, without relying on crowdsourcing or real-world data. SynTOD utilizes a state transition graph to define the desired behavior of a TOD system and generates diverse, structured conversations through random walks and response simulation using large language models (LLMs). In our experiments, using graph-guided response simulations leads to significant improvements in intent classification, slot filling and response relevance compared to naive single-prompt simulated conversations. We also investigate the end-to-end TOD effectiveness of different base and instruction-tuned LLMs, with and without the constructed synthetic conversations. Finally, we explore how various LLMs can evaluate responses in a TOD system and how well they are correlated with human judgments. Our findings pave the path towards quick development and evaluation of domain-specific TOD systems. We release our datasets, models, and code for research purposes.
arXiv.org Artificial Intelligence
Apr-23-2024
- Country:
- North America
- Dominican Republic (0.04)
- United States
- New York > New York County
- New York City (0.04)
- Massachusetts > Hampshire County
- Amherst (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- New York > New York County
- Europe
- France (0.04)
- Czechia > Prague (0.04)
- Spain > Valencian Community
- Valencia Province > Valencia (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Germany > Saarland
- Saarbrücken (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia
- South Korea (0.14)
- Singapore (0.04)
- Indonesia > Bali (0.04)
- China > Hong Kong (0.04)
- Middle East
- Jordan (0.04)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.04)
- India > Karnataka
- Bengaluru (0.04)
- North America
- Genre:
- Research Report > New Finding (1.00)
- Technology: