Improving Spoken Dialogue Understanding Using Phonetic Mixture Models

Wang, William Yang (Columbia University) | Artstein, Ron (USC Institute for Creative Technologies) | Leuski, Anton (USC Institute for Creative Technologies) | Traum, David (USC Institute for Creative Technologies)

May-18-2011–AAAI Conferences

Augmenting word tokens with a phonetic representation, derived from a dictionary, improves the performance of a Natural Language Understanding component that interprets speech recognizer output: we observed a 5% to 7% reduction in errors across a wide range of response return rates. The best performance comes from mixture models incorporating both word and phone features. Since the phonetic representation is derived from a dictionary, the method can be applied easily without the need for integration with a specific speech recognizer. The method has similarities with autonomous (or bottom-up) psychological models of lexical access, where contextual information is not integrated at the stage of auditory perception but rather later.

language model, tokenizer, utterance, (15 more...)

AAAI Conferences

May-18-2011

Conferences PDF

Add feedback

Country:
- North America > United States
  - Pennsylvania > Allegheny County
    - Pittsburgh (0.04)
  - New York > Monroe County
    - Rochester (0.04)
  - California
    - San Francisco County > San Francisco (0.04)
    - San Diego County > Vista (0.04)
- Europe
  - Sweden > Stockholm
    - Stockholm (0.04)
  - Netherlands > North Holland
    - Amsterdam (0.04)

Genre:
- Research Report (0.46)

Industry:
- Government (0.69)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Discourse & Dialogue (1.00)
  - Speech > Speech Recognition (0.93)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found