I love your chain mail! Making knights smile in a fantasy game world: Open-domain goal-oriented dialogue agents
Prabhumoye, Shrimai, Li, Margaret, Urbanek, Jack, Dinan, Emily, Kiela, Douwe, Weston, Jason, Szlam, Arthur
–arXiv.org Artificial Intelligence
Dialogue research tends to distinguish between chit-chat and goal-oriented tasks. While the former is arguably more naturalistic and has a wider use of language, the latter has clearer metrics and a straightforward learning signal. Humans effortlessly combine the two, for example engaging in chit-chat with the goal of exchanging information or eliciting a specific response. Here, we bridge the divide between these two domains in the setting of a rich multi-player text-based fantasy environment where agents and humans engage in both actions and dialogue. Specifically, we train a goal-oriented model with reinforcement learning against an imitation-learned ``chit-chat'' model with two approaches: the policy either learns to pick a topic or learns to pick an utterance given the top-K utterances from the chit-chat model. We show that both models outperform an inverse model baseline and can converse naturally with their dialogue partner in order to achieve goals.
arXiv.org Artificial Intelligence
Feb-10-2020
- Country:
- Europe > Germany
- Saarland > Saarbrücken (0.04)
- North America > United States
- Pennsylvania > Allegheny County > Pittsburgh (0.14)
- Europe > Germany
- Genre:
- Personal > Interview (0.46)
- Research Report (0.64)
- Industry:
- Leisure & Entertainment
- Gambling (0.86)
- Games > Computer Games (1.00)
- Sports (0.86)
- Leisure & Entertainment
- Technology: