Using Voice Transformations to Create Additional Training Talkers for Word Spotting

Chang, Eric I., Lippmann, Richard P.

Neural Information Processing Systems 

Lack of training data has always been a constraint in training speech recognizers. This research presents a voice transformation technique which increases the variety among training talkers. The resulting more varied training set provided up to 2.9 percentage points of improvement in the figure of merit (average detection rate) of a high performance word spotter. This improvement is similar to the increase in performance provided by doubling the amount of training data (Carlson, 1994). This technique can also be applied to other speech recognition systems such as continuous speech recognition, talker identification, and isolated speech recognition.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found