Reinforcement Learning for Spoken Dialogue Systems
Singh, Satinder P., Kearns, Michael J., Litman, Diane J., Walker, Marilyn A.
–Neural Information Processing Systems
Recently, a number of authors have proposed treating dialogue systems as Markov decision processes (MDPs). However, the practical application ofMDP algorithms to dialogue systems faces a number of severe technical challenges. We have built a general software tool (RLDS, for Reinforcement Learning for Dialogue Systems) based on the MDP framework, and have applied it to dialogue corpora gathered from two dialogue systems built at AT&T Labs. Our experiments demonstrate that RLDS holds promise as a tool for "browsing" and understanding correlations in complex, temporally dependent dialogue corpora.
Neural Information Processing Systems
Dec-31-2000