Connectionist Architectures for Multi-Speaker Phoneme Recognition

Dec-31-1990–Neural Information Processing Systems

We present a number of Time-Delay Neural Network (TDNN) based architectures for multi-speaker phoneme recognition (/b,d,g/ task). We use speech of two females and four males to compare the performance of the various architectures against a baseline recognition rate of 95.9% for a single IDNN on the six-speaker /b,d,g/ task. This series of modular designs leads to a highly modular multi-network architecture capable of performing the six-speaker recognition task at the speaker dependent rate of 98.4%. In addition to its high recognition rate, the so-called "Meta-Pi" architecture learns - without direct supervision - to recognize the speech of one particular male speaker using internal models of other male speakers exclusively.

architecture, connectionist architecture, neural network, (14 more...)

Neural Information Processing Systems

Dec-31-1990

Conferences PDF

Add feedback

Country:
- North America > United States
  - District of Columbia > Washington (0.04)
  - Pennsylvania > Allegheny County
    - Pittsburgh (0.04)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Duplicate Docs Excel Report

Title
Connectionist Architectures for Multi-Speaker Phoneme Recognition
Connectionist Architectures for Multi-Speaker Phoneme Recognition

Similar Docs Excel Report more

Title	Similarity	Source
None found