Reinforcement Learning for Spoken Dialogue Systems
Singh, Satinder P., Kearns, Michael J., Litman, Diane J., Walker, Marilyn A.
–Neural Information Processing Systems
Recently,a number of authorshave proposedtreating dialogue systems as Markov decision processes(MDPs). However,the practicalapplicationofMDP algorithms to dialogue systems faces a numberof severe technicalchallenges.We have built a general software tool (RLDS, for ReinforcementLearning for Dialogue Systems) on the MDP framework, and have applied it to dialogue corpora gatheredbased from two dialoguesystemsbuilt at AT&T Labs. Our experimentsdemonstratethat RLDS holds promise as a tool for "browsing" and understandingcorrelationsin complex, temporallydependentdialogue corpora.
Neural Information Processing Systems
Dec-31-2000