Improving Spoken Dialogue Understanding Using Phonetic Mixture Models

Wang, William Yang (Columbia University) | Artstein, Ron (USC Institute for Creative Technologies) | Leuski, Anton (USC Institute for Creative Technologies) | Traum, David (USC Institute for Creative Technologies)

AAAI Conferences 

Augmenting word tokens with a phonetic representation, derived from a dictionary, improves the performance of a Natural Language Understanding component that interprets speech recognizer output: we observed a 5% to 7% reduction in errors across a wide range of response return rates. The best performance comes from mixture models incorporating both word and phone features. Since the phonetic representation is derived from a dictionary, the method can be applied easily without the need for integration with a specific speech recognizer. The method has similarities with autonomous (or bottom-up) psychological models of lexical access, where contextual information is not integrated at the stage of auditory perception but rather later.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found