Discriminative Keyword Selection Using Support Vector Machines
Richardson, Fred, Campbell, William M.
–Neural Information Processing Systems
Many tasks in speech processing involve classification of long term characteristics of a speech segment such as language, speaker, dialect, or topic. A natural technique fordetermining these characteristics is to first convert the input speech into a sequence of tokens such as words, phones, etc. From these tokens, we can then look for distinctive sequences, keywords, that characterize the speech. In many applications, a set of distinctive keywords may not be known a priori. In this case, an automatic method of building up keywords from short context units such as phones is desirable. We propose a method for the construction of keywords based upon Support Vector Machines. We cast the problem of keyword selection as a feature selection problem for n-grams of phones. We propose an alternating filter-wrappermethod that builds successively longer keywords. Application of this method to language recognition and topic recognition tasks shows that the technique produces interesting and significant qualitative and quantitative results.
Neural Information Processing Systems
Dec-31-2008
- Country:
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
- Industry:
- Government > Regional Government (0.68)
- Technology: