AITopics

The English alphabet is difficult to recognize automatically because many letters sound alike; e.g., BID, PIT, VIZ and F IS. When spoken over the telephone, the information needed to discriminate among several of these pairs, such as F IS, PIT, BID and VIZ, is further reduced due to the limited bandwidth of the channel Speaker-independent recognition of spelled names over the telephone is difficult due to variability caused by channel distortions, different handsets, and a variety of background noises. Finally, when dealing with a large population of speakers, dialect and foreign accents alter letter pronunciations. An R from a Boston speaker may not contain an [r]. Human classification performance on telephone speech underscores the difficulty of the problem.

category, classification, segmentation, (10 more...)

Country:

North America > United States > Oregon > Washington County > Beaverton (0.04)
North America > United States > Massachusetts (0.04)
North America > United States > California > San Mateo County > San Mateo (0.04)
North America > United States > California > San Diego County > San Diego (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.31)

Hirayama, Makoto, Vatikiotis-Bateson, Eric, Kawato, Mitsuo, Jordan, Michael I.

Forward Dynamics Modeling of Speech Motor Control Using Physiological Data

We propose a paradigm for modeling speech production based on neural networks. We focus on characteristics of the musculoskeletal system. Using real physiological data - articulator movements and EMG from muscle activitya neural network learns the forward dynamics relating motor commands to muscles and the ensuing articulator behavior. After learning, simulated perturbations, were used to asses properties of the acquired model, such as natural frequency, damping, and interarticulator couplings. Finally, a cascade neural network is used to generate continuous motor commands from a sequence of discrete articulatory targets.

forward dynamic modeling, motor command, neural network, (12 more...)

Country:

Asia > Middle East > Jordan (0.18)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.05)
North America > United States > California (0.05)
Asia > Japan > Honshū > Kansai > Kyoto Prefecture > Kyoto (0.05)

Industry: Health & Medicine > Therapeutic Area > Musculoskeletal (0.35)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

JANUS: Speech-to-Speech Translation Using Connectionist and Non-Connectionist Techniques

Waibel, Alex, Jain, Ajay N., McNair, Arthur E., Tebelskis, Joe, Osterholtz, Louise, Saito, Hiroaki, Schmidbauer, Otto, Sloboda, Tilo, Woszczyna, Monika

JANUS translates continuously spoken English and German into German, English, and Japanese. JANUS currently achieves 87% translation fidelity from English speech and 97% from German speech. We present the JANUS system along with comparative evaluations of its interchangeable processing components, with special emphasis on the connectionist modules.

grammar, janus, translation, (14 more...)

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
Europe > Germany > Baden-Württemberg > Karlsruhe Region > Karlsruhe (0.06)
North America > United States > California > San Mateo County > San Mateo (0.05)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.97)

Bengio, Yoshua, Mori, Renato De, Flammia, Giovanni, Kompe, Ralf

Neural Network - Gaussian Mixture Hybrid for Speech Recognition or Density Estimation

The subject of this paper is the integration of multi-layered Artificial Neural Networks (ANN) with probability density functions such as Gaussian mixtures found in continuous density Hidden Markov Models (HMM). In the first part of this paper we present an ANN/HMM hybrid in which all the parameters of the the system are simultaneously optimized with respect to a single criterion. In the second part of this paper, we study the relationship between the density of the inputs of the network and the density of the outputs of the networks. A few experiments are presented to explore how to perform density estimation with ANNs. 1 INTRODUCTION This paper studies the integration of Artificial Neural Networks (ANN) with probability density functions (pdf) such as the Gaussian mixtures often used in continuous density Hidden Markov Models. The ANNs considered here are multi-layered or recurrent networks with hyperbolic tangent hidden units.

experiment, gaussian mixture, likelihood, (11 more...)

Country:

North America > Canada > Quebec > Montreal (0.14)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > New Mexico > Bernalillo County > Albuquerque (0.04)
(4 more...)

Genre: Research Report (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Renals, Steve, Morgan, Nelson, Bourlard, Hervé, Franco, Horacio, Cohen, Michael

Connectionist Optimisation of Tied Mixture Hidden Markov Models

Issues relating to the estimation of hidden Markov model (HMM) local probabilities are discussed. In particular we note the isomorphism of radial basis functions (RBF) networks to tied mixture density modellingj additionally we highlight the differences between these methods arising from the different training criteria employed. We present a method in which connectionist training can be modified to resolve these differences and discuss some preliminary experiments. Finally, we discuss some outstanding problems with discriminative training.

connectionist optimisation, probability, speech recognition, (13 more...)

Country:

North America > United States > California > San Mateo County > San Mateo (0.05)
North America > Canada > Ontario > Toronto (0.05)
North America > United States > California > San Mateo County > Menlo Park (0.04)
(4 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Singer, Elliot, Lippmann, Richard P.

Improved Hidden Markov Model Speech Recognition Using Radial Basis Function Networks

The RBF network consists of an input layer, a hidden layer composed of Gaussian basis functions, and an output layer. Connections from the input layer to the hidden layer are fixed at unity while those from the hidden layer to the output layer are trained by minimizing the overall mean-square error between actual and desired output values. Each RBF output node has a corresponding state in a set of HMM word models which represent the words in the vocabulary. HMM word models are left-to-right with no skip states and have a one-state background noise model at either end. The background noise models are identical for all words.

hybrid recognizer, rbf network, recognizer, (11 more...)

Country:

North America > United States > California > San Mateo County > San Mateo (0.05)
North America > United States > Oregon (0.04)
North America > United States > Massachusetts > Middlesex County > Lexington (0.04)
North America > Canada > Quebec > Montreal (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Levin, Esther, Pieraccini, Roberto, Bocchieri, Enrico

Time-Warping Network: A Hybrid Framework for Speech Recognition

Such systems attempt to combine the best features of both models: the temporal structure of HMMs and the discriminative power of neural networks. In this work we define a time-warping (1W) neuron that extends the operation of the fonnal neuron of a back-propagation network by warping the input pattern to match it optimally to its weights. We show that a single-layer network of TW neurons is equivalent to a Gaussian density HMMbased recognition system.

neuron, recognizer, time-warping network, (15 more...)

Country:

North America > United States > New Mexico > Bernalillo County > Albuquerque (0.04)
North America > Canada > Ontario > Toronto (0.04)

Technology:

Information Technology > Artificial Intelligence > Speech (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.33)

Príncipe, José Carlos, Vries, Bert de, Kuo, Jyh-Ming, Oliveira, Pedro Guedes de

Modeling Applications with the Focused Gamma Net

The focused gamma network is proposed as one of the possible implementations of the gamma neural model. The focused gamma network is compared with the focused backpropagation network and TDNN for a time series prediction problem, and with ADALINE in a system identification problem.

gamma memory, gamma network, information, (15 more...)

Country:

Asia > Middle East > Jordan (0.05)
North America > United States > Florida > Alachua County > Gainesville (0.04)
North America > United States > California > San Mateo County > San Mateo (0.04)
Europe > Portugal > Aveiro > Aveiro (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Haffner, Patrick, Waibel, Alex

Multi-State Time Delay Networks for Continuous Speech Recognition

We present the "Multi-State Time Delay Neural Network" (MS-TDNN) as an extension of the TDNN to robust word recognition.

procedure, recognition, training procedure, (12 more...)

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)
North America > Canada > Quebec > Montreal (0.14)
North America > United States > New Mexico > Bernalillo County > Albuquerque (0.04)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.95)

Horn, David, Usher, Marius

Oscillatory Model of Short Term Memory

It seems quite natural to assume that the limited capacity is due to the special dynamical nature of STM. Recently, Crick and Koch (1990) suggested that the working memory is functionally related to the binding process, and is obtained via synchronized oscillations of neural populations. The capacity limitation of STM may then result from the competition between oscillations representing items in STM. In the model which we investigate this is indeed the case.

oscillation, staggered oscillation, time scale, (15 more...)

Country:

Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.05)
North America > United States > California > Los Angeles County > Pasadena (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.47)

Technology: Information Technology > Artificial Intelligence > Cognitive Science (0.86)