Physiologically Based Speech Synthesis

Hirayama, Makoto, Vatikiotis-Bateson, Eric, Honda, Kiyoshi, Koike, Yasuharu, Kawato, Mitsuo

Neural Information Processing Systems 

This study demonstrates a paradigm for modeling speech production basedon neural networks. Using physiological data from speech utterances, a neural network learns the forward dynamics relating motor commands to muscles and the ensuing articulator behavior that allows articulator trajectories to be generated from motor commands constrained by phoneme input strings and global performance parameters. From these movement trajectories, a second neuralnetwork generates PARCOR parameters that are then used to synthesize the speech acoustics.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found