Deep Reinforcement Learning for Multi-Domain Dialogue Systems
Cuayáhuitl, Heriberto, Yu, Seunghak, Williamson, Ashley, Carse, Jacob
–arXiv.org Artificial Intelligence
Standard deep reinforcement learning methods such as Deep Q-Networks (DQN) for multiple tasks (domains) face scalability problems. We propose a method for multi-domain dialogue policy learning---termed NDQN, and apply it to an information-seeking spoken dialogue system in the domains of restaurants and hotels. Experimental results comparing DQN (baseline) versus NDQN (proposed) using simulations report that our proposed method exhibits better scalability and is promising for optimising the behaviour of multi-domain dialogue systems.
arXiv.org Artificial Intelligence
Nov-26-2016
- Country:
- North America > United States
- California > Santa Clara County > Palo Alto (0.04)
- Europe
- United Kingdom (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Asia > South Korea
- North America > United States
- Genre:
- Research Report (0.64)
- Industry:
- Consumer Products & Services
- Hotels (0.49)
- Restaurants (0.35)
- Consumer Products & Services
- Technology: