Simulating Task-Oriented Dialogues with State Transition Graphs and Large Language Models

Samarinas, Chris, Promthaw, Pracha, Nijasure, Atharva, Zeng, Hansi, Killingback, Julian, Zamani, Hamed

Apr-23-2024–arXiv.org Artificial Intelligence

This paper explores SynTOD, a new synthetic data generation approach for developing end-to-end Task-Oriented Dialogue (TOD) Systems capable of handling complex tasks such as intent classification, slot filling, conversational question-answering, and retrieval-augmented response generation, without relying on crowdsourcing or real-world data. SynTOD utilizes a state transition graph to define the desired behavior of a TOD system and generates diverse, structured conversations through random walks and response simulation using large language models (LLMs). In our experiments, using graph-guided response simulations leads to significant improvements in intent classification, slot filling and response relevance compared to naive single-prompt simulated conversations. We also investigate the end-to-end TOD effectiveness of different base and instruction-tuned LLMs, with and without the constructed synthetic conversations. Finally, we explore how various LLMs can evaluate responses in a TOD system and how well they are correlated with human judgments. Our findings pave the path towards quick development and evaluation of domain-specific TOD systems. We release our datasets, models, and code for research purposes.

dataset, state transition graph, tod system, (14 more...)

arXiv.org Artificial Intelligence

Apr-23-2024

arXiv.org PDF

Add feedback

Country:
- North America
  - Dominican Republic (0.04)
  - United States
    - New York > New York County
      - New York City (0.04)
    - Massachusetts > Hampshire County
      - Amherst (0.04)
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
- Europe
  - France (0.04)
  - Czechia > Prague (0.04)
  - Spain > Valencian Community
    - Valencia Province > Valencia (0.04)
  - Portugal > Lisbon
    - Lisbon (0.04)
  - Germany > Saarland
    - Saarbrücken (0.04)
  - Belgium > Brussels-Capital Region
    - Brussels (0.04)
- Asia
  - South Korea (0.14)
  - Singapore (0.04)
  - Indonesia > Bali (0.04)
  - China > Hong Kong (0.04)
  - Middle East
    - Jordan (0.04)
    - UAE > Abu Dhabi Emirate
      - Abu Dhabi (0.04)
  - India > Karnataka
    - Bengaluru (0.04)

Genre:
- Research Report > New Finding (1.00)

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)