AITopics

This paper presents a new approach to speech recognition with hybrid HMM/ANN technology. While the standard approach to hybrid HMMI ANN systems is based on the use of neural networks as posterior probability estimators, the new approach is based on the use of mutual information neural networks trained with a special learning algorithm in order to maximize the mutual information between the input classes of the network and its resulting sequence of firing output neurons during training. It is shown in this paper that such a neural network is an optimal neural vector quantizer for a discrete hidden Markov model system trained on Maximum Likelihood principles. One of the main advantages of this approach is the fact, that such neural networks can be easily combined with HMM's of any complexity with context-dependent capabilities. It is shown that the resulting hybrid system achieves very high recognition rates, which are now already on the same level as the best conventional HMM systems with continuous parameters, and the capabilities of the mutual information neural networks are not yet entirely exploited.

acoustic processor, neural network, probability, (11 more...)

Country: Europe > Germany (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.90)

Platt, John C., Matic, Nada

A Constructive RBF Network for Writer Adaptation

This paper discusses a fairly general adaptation algorithm which augments a standard neural network to increase its recognition accuracy for a specific user. The basis for the algorithm is that the output of a neural network is characteristic of the input, even when the output is incorrect. We exploit this characteristic output by using an Output Adaptation Module (OAM) which maps this output into the correct user-dependent confidence vector. The OAM is a simplified Resource Allocating Network which constructs radial basis functions online. We applied the OAM to construct a writer-adaptive character recognition system for online handprinted characters.

neural network, oam, recognizer, (14 more...)

Country:

North America > United States > California > Santa Clara County > San Jose (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)

Genre: Research Report (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.68)

Lee, Te-Won, Bell, Anthony J., Lambert, Russell H.

Blind Separation of Delayed and Convolved Sources

We address the difficult problem of separating multiple speakers with multiple microphones in a real room. We combine the work of Torkkola and Amari, Cichocki and Yang, to give Natural Gradient information maximisation rules for recurrent (IIR) networks, blindly adjusting delays, separating and deconvolving mixed signals. While they work well on simulated data, these rules fail in real rooms which usually involve non-minimum phase transfer functions, not-invertible using stable IIR filters. An approach that sidesteps this problem is to perform infomax on a feedforward architecture in the frequency domain (Lambert 1996). We demonstrate real-room separation of two natural signals using this approach.

architecture, blind separation, separation, (14 more...)

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > California > San Diego County > La Jolla (0.04)
Europe > Germany (0.04)
Asia > Japan > Shikoku > Kōchi Prefecture > Kochi (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Gray, Michael S., Movellan, Javier R., Sejnowski, Terrence J.

Dynamic Features for Visual Speechreading: A Systematic Comparison

Humans use visual as well as auditory speech signals to recognize spoken words. A variety of systems have been investigated for performing this task. The main purpose of this research was to systematically compare the performance of a range of dynamic visual features on a speechreading task. We have found that normalization of images to eliminate variation due to translation, scale, and planar rotation yielded substantial improvements in generalization performance regardless of the visual representation used. In addition, the dynamic information in the difference between successive frames yielded better performance than optical-flow based approaches, and compression by local low-pass filtering worked surprisingly better than global principal components analysis (PCA). These results are examined and possible explanations are explored.

information, representation, visual information, (12 more...)

Country:

North America > United States > California > San Diego County > San Diego (0.07)
North America > United States > California > San Diego County > La Jolla (0.05)

Technology:

Information Technology > Artificial Intelligence > Vision (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.33)

Schaik, André van, Fragnière, Eric, Vittoz, Eric A.

A Silicon Model of Amplitude Modulation Detection in the Auditory Brainstem

Detectim of the periodicity of amplitude modulatim is a major step in the determinatim of the pitch of a SOODd. In this article we will present a silicm model that uses synchrroicity of spiking neurms to extract the fundamental frequency of a SOODd. It is based m the observatim that the so called'Choppers' in the mammalian Cochlear Nucleus synchrmize well for certain rates of amplitude modulatim, depending m the cell's intrinsic chopping frequency. Our silicm model uses three different circuits, i.e., an artificial cochlea, an Inner Hair Cell circuit, and a spiking neuron circuit

chopper, frequency, neuron, (14 more...)

Country:

North America > United States > New York (0.04)
Europe > Switzerland > Vaud > Lausanne (0.04)

Technology: Information Technology > Artificial Intelligence (0.69)

Pineda, Fernando J., Cauwenberghs, Gert, Edwards, R. Timothy

Bangs, Clicks, Snaps, Thuds and Whacks: An Architecture for Acoustic Transient Processing

We report progress towards our long-term goal of developing low-cost, low-power, lowcomplexity analog-VLSI processors for real-time applications. We propose a neuromorphic architecture for acoustic processing in analog VLSI. The characteristics of the architecture are explored by using simulations and real-world acoustic transients. We use acoustic transients in our experiments because information in the form of acoustic transients pervades the natural world. Insects, birds, and mammals (especially marine mammals) all employ acoustic signals with rich transient structure.

algorithm, representation, template, (14 more...)

Country:

North America > United States > Maryland > Baltimore (0.14)
North America > United States > Michigan > Wayne County > Detroit (0.04)
North America > United States > Maryland > Prince George's County > Laurel (0.04)
North America > United States > District of Columbia > Washington (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Speech > Acoustic Processing (0.35)

Lazzaro, John, Wawrzynek, John, Lippmann, Richard P.

A Micropower Analog VLSI HMM State Decoder for Wordspotting

We describe the implementation of a hidden Markov model state decoding system, a component for a wordspotting speech recognition system. The key specification for this state decoder design is microwatt power dissipation; this requirement led to a continuoustime, analog circuit implementation. We characterize the operation of a 10-word (81 state) state decoder test chip.

analog vlsi hmm state decoder, likelihood, log likelihood, (11 more...)

Country:

North America > United States > Massachusetts > Middlesex County > Lexington (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > California > Alameda County > Berkeley (0.04)

Industry:

Government > Military (0.69)
Government > Regional Government (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.90)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.56)

Iizuka, Kunihiko, Miyamoto, Masayuki, Matsui, Hirofumi

Dynamically Adaptable CMOS Winner-Take-All Neural Network

The major problem that has prevented practical application of analog neuro-LSIs has been poor accuracy due to fluctuating analog device characteristics inherent in each device as a result of manufacturing. This paper proposes a dynamic control architecture that allows analog silicon neural networks to compensate for the fluctuating device characteristics and adapt to a change in input DC level. We have applied this architecture to compensate for input offset voltages of an analog CMOS WTA (Winner-Take-AlI) chip that we have fabricated. Experimental data show the effectiveness of the architecture.

architecture, node, voltage, (11 more...)

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Asia > Japan > Honshū > Kansai > Osaka Prefecture > Osaka (0.04)

Industry: Semiconductors & Electronics (0.37)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.65)

Horiuchi, Timothy K., Morris, Tonia G., Koch, Christof, DeWeerth, Stephen P.

Analog VLSI Circuits for Attention-Based, Visual Tracking

A one-dimensional visual tracking chip has been implemented using neuromorphic, analog VLSI techniques to model selective visual attention in the control of saccadic and smooth pursuit eye movements. The chip incorporates focal-plane processing to compute image saliency and a winner-take-all circuit to select a feature for tracking. The target position and direction of motion are reported as the target moves across the array. We demonstrate its functionality in a closed-loop system which performs saccadic and smooth pursuit tracking movements using a one-dimensional mechanical eye.

analog vlsi circuit, attention-based, saliency map, (11 more...)

Country:

North America > United States > Georgia > Fulton County > Atlanta (0.05)
North America > United States > California > Los Angeles County > Pasadena (0.05)
Europe > Switzerland > Vaud > Lausanne (0.04)

Industry: Semiconductors & Electronics (0.66)

Technology: Information Technology > Artificial Intelligence (0.47)

Häfliger, Philipp, Mahowald, Misha, Watts, Lloyd

A Spike Based Learning Neuron in Analog VLSI

Many popular learning rules are formulated in terms of continuous, analog inputs and outputs. Biological systems, however, use action potentials, which are digital-amplitude events that encode analog information in the inter-event interval. Action-potential representations are now being used to advantage in neuromorphic VLSI systems as well. We report on a simple learning rule, based on the Riccati equation described by Kohonen [1], modified for action-potential neuronal outputs. We demonstrate this learning rule in an analog VLSI chip that uses volatile capacitive storage for synaptic weights. We show that our time-dependent learning rule is sufficient to achieve approximate weight normalization and can detect temporal correlations in spike trains.

neuron, spike, synapse, (13 more...)

Country:

Europe > Switzerland > Zürich > Zürich (0.15)
North America > United States > California > Santa Clara County > Santa Clara (0.04)

Industry: Semiconductors & Electronics (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.72)