AITopics | Waibel, Alex

Collaborating Authors

Waibel, Alex

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Multilingual Adaptation of RNN Based ASR Systems

Müller, Markus, Stüker, Sebastian, Waibel, Alex

arXiv.org Artificial IntelligenceFeb-27-2018

In this work, we focus on multilingual systems based on recurrent neural networks (RNNs), trained using the Connectionist Temporal Classification (CTC) loss function. Using a multilingual set of acoustic units poses difficulties. To address this issue, we proposed Language Feature Vectors (LFVs) to train language adaptive multilingual systems. Language adaptation, in contrast to speaker adaptation, needs to be applied not only on the feature level, but also to deeper layers of the network. In this work, we therefore extended our previous approach by introducing a novel technique which we call "modulation". Based on this method, we modulated the hidden layers of RNNs using LFVs. We evaluated this approach in both full and low resource conditions, as well as for grapheme and phone based systems. Lower error rates throughout the different conditions could be achieved by the use of the modulation.

adaptation, deep learning, neural network, (19 more...)

arXiv.org Artificial Intelligence

1711.04569

Country:

Europe > Germany (0.15)
North America > Canada (0.14)
Europe > Sweden (0.14)

Genre: Research Report > Promising Solution (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.99)

Add feedback

Adaptively Growing Hierarchical Mixtures of Experts

Fritsch, Jürgen, Finke, Michael, Waibel, Alex

Neural Information Processing SystemsDec-31-1997

We propose a novel approach to automatically growing and pruning Hierarchical Mixtures of Experts. The constructive algorithm proposed here enables large hierarchies consisting of several hundred experts to be trained effectively. We show that HME's trained by our automatic growing procedure yield better generalization performance than traditional static and balanced hierarchies. Evaluation of the algorithm is performed (1) on vowel classification and (2) within a hybrid version of the JANUS r9] speech recognition system using a subset of the Switchboard large-vocabulary speaker-independent continuous speech recognition database.

artificial intelligence, hme, speech recognition, (20 more...)

Neural Information Processing Systems

Country: North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.95)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.90)

Add feedback

Adaptively Growing Hierarchical Mixtures of Experts

Fritsch, Jürgen, Finke, Michael, Waibel, Alex

Neural Information Processing SystemsDec-31-1997

We propose a novel approach to automatically growing and pruning Hierarchical Mixtures of Experts. The constructive algorithm proposed hereenables large hierarchies consisting of several hundred experts to be trained effectively. We show that HME's trained by our automatic growing procedure yield better generalization performance thantraditional static and balanced hierarchies. Evaluation of the algorithm is performed (1) on vowel classification and (2) within a hybrid version of the JANUS r9] speech recognition systemusing a subset of the Switchboard large-vocabulary speaker-independent continuous speech recognition database.

artificial intelligence, hme, speech recognition, (20 more...)

Neural Information Processing Systems

Country: North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.95)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.70)

Add feedback

The Use of Dynamic Writing Information in a Connectionist On-Line Cursive Handwriting Recognition System

Manke, Stefan, Finke, Michael, Waibel, Alex

Neural Information Processing SystemsDec-31-1995

This system combines a robust input representation, which preserves the dynamic writing information, with a neural network architecture, a so called Multi-State Time Delay Neural Network (MS-TDNN), which integrates rec.ognition and segmentation ina single framework. Our preprocessing transforms the original coordinate sequence into a (still temporal) sequence offeature vectors,which combine strictly local features, like curvature or writing direction, with a bitmap-like representation of the coordinate's proximity.The MS-TDNN architecture is well suited for handling temporal sequences as provided by this input representation. Oursystem is tested both on writer dependent and writer independent tasks with vocabulary sizes ranging from 400 up to 20,000 words. For example, on a 20,000 word vocabulary we achieve word recognition rates up to 88.9% (writer dependent) and 84.1 % (writer independent) without using any language models.

handwriting recognition, neural network, sequence, (15 more...)

Neural Information Processing Systems

Country: North America > United States (0.28)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.95)
Information Technology > Artificial Intelligence > Vision > Handwriting Recognition (0.90)

Add feedback

The Use of Dynamic Writing Information in a Connectionist On-Line Cursive Handwriting Recognition System

Manke, Stefan, Finke, Michael, Waibel, Alex

Neural Information Processing SystemsDec-31-1995

This system combines a robust input representation, which preserves the dynamic writing information, with a neural network architecture, a so called Multi-State Time Delay Neural Network (MS-TDNN), which integrates rec.ognition and segmentation in a single framework. Our preprocessing transforms the original coordinate sequence into a (still temporal) sequence offeature vectors, which combine strictly local features, like curvature or writing direction, with a bitmap-like representation of the coordinate's proximity. The MS-TDNN architecture is well suited for handling temporal sequences as provided by this input representation. Our system is tested both on writer dependent and writer independent tasks with vocabulary sizes ranging from 400 up to 20,000 words. For example, on a 20,000 word vocabulary we achieve word recognition rates up to 88.9% (writer dependent) and 84.1 % (writer independent) without using any language models.

handwriting recognition, neural network, sequence, (15 more...)

Neural Information Processing Systems

Country: North America > United States (0.28)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.95)
Information Technology > Artificial Intelligence > Vision > Handwriting Recognition (0.90)

Add feedback

Performance Through Consistency: MS-TDNN's for Large Vocabulary Continuous Speech Recognition

Tebelskis, Joe, Waibel, Alex

Neural Information Processing SystemsDec-31-1993

Connectionist Rpeech recognition systems are often handicapped by an inconsistency between training and testing criteria. This problem is addressed by the Multi-State Time Delay Neural Network (MS-TDNN), a hierarchical phonf'mp and word classifier which uses DTW to modulate its connectivit.y

ms-tdnn, neural network, speech recognition, (14 more...)

Neural Information Processing Systems

Country: North America > United States (0.28)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.91)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.58)

Add feedback

Connected Letter Recognition with a Multi-State Time Delay Neural Network

Hild, Hermann, Waibel, Alex

Neural Information Processing SystemsDec-31-1993

We present an MS-TDNN for recognizing butcontinuously spelled letters, a task characterized by a small highly confusable vocabulary.

artificial intelligence, neural network, recognition, (15 more...)

Neural Information Processing Systems

Country:

North America > United States (0.29)
North America > Canada > Ontario (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Performance Through Consistency: MS-TDNN's for Large Vocabulary Continuous Speech Recognition

Tebelskis, Joe, Waibel, Alex

Neural Information Processing SystemsDec-31-1993

Connectionist Rpeech recognition systems are often handicapped by an inconsistency between training and testing criteria. This problem isaddressed by the Multi-State Time Delay Neural Network (MS-TDNN), a hierarchical phonf'mp and word classifier which uses DTW to modulate its connectivit.y

ms-tdnn, neural network, speech recognition, (15 more...)

Neural Information Processing Systems

Country: North America > United States (0.28)

Technology: