AITopics

It has been observed in numerical simulations that a weight decay can improve generalizationin a feed-forward neural network.

artificial intelligence, neural network, weight decay, (14 more...)

Country:

Europe (0.29)
North America > United States > California > Santa Cruz County > Santa Cruz (0.14)

Genre: Research Report (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Röscheisen, Martin, Hofmann, Reimar, Tresp, Volker

Neural Control for Rolling Mills: Incorporating Domain Theories to Overcome Data Deficiency

In a Bayesian framework, we give a principled account of how domainspecific priorknowledge such as imperfect analytic domain theories can be optimally incorporated into networks of locally-tuned units: by choosing a specific architecture and by applying a specific training regimen. Our method proved successful in overcoming the data deficiency problem in a large-scale application to devise a neural control for a hot line rolling mill. It achieves in this application significantly higher accuracy than optimally-tuned standard algorithms such as sigmoidal backpropagation, and outperforms the state-of-the-art solution.

artificial intelligence, bayesian inference, neural network, (17 more...)

Genre: Research Report > Promising Solution (0.35)

Industry: Materials > Metals & Mining (0.74)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.90)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.68)

Benchmarking Feed-Forward Neural Networks: Models and Measures

Hamey, Leonard G. C.

Existing metrics for the learning performance of feed-forward neural networks do not provide a satisfactory basis for comparison because the choice of the training epoch limit can determine the results of the comparison. I propose new metrics which have the desirable property of being independent of the training epoch limit. The efficiency measures the yield of correct networks in proportion to the training effort expended. The optimal epoch limit provides the greatest efficiency. The learning performance is modelled statistically, and asymptotic performance is estimated. Implementation details may be found in (Harney, 1992). 1 Introduction The empirical comparison of neural network training algorithms is of great value in the development of improved techniques and in algorithm selection for problem solving. In view of the great sensitivity of learning times to the random starting weights (Kolen and Pollack, 1990), individual trial times such as reported in (Rumelhart, et al., 1986) are almost useless as measures of learning performance.

artificial intelligence, epoch limit, neural network, (14 more...)

Country: North America > United States > Massachusetts (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Mozer, Michael C., Zemel, Richard S., Behrmann, Marlene

Learning to Segment Images Using Dynamic Feature Binding

Despite the fact that complex visual scenes contain multiple, overlapping objects, people perform object recognition with ease and accuracy. One operation that facilitates recognition is an early segmentation process in which features of objects are grouped and labeled according to which object theybelong. Current computational systems that perform this operation arebased on predefined grouping heuristics.

artificial intelligence, feature unit, neural network, (18 more...)

Country:

North America > Canada > Ontario > Toronto (0.15)
North America > United States > Colorado > Boulder County > Boulder (0.14)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)
Information Technology > Artificial Intelligence > Cognitive Science (0.68)

The Efficient Learning of Multiple Task Sequences

Singh, Satinder P.

I present a modular network architecture and a learning algorithm based on incremental dynamic programming that allows a single learning agent to learn to solve multiple Markovian decision tasks (MDTs) with significant transferof learning across the tasks. I consider a class of MDTs, called composite tasks, formed by temporally concatenating a number of simpler, elemental MDTs. The architecture is trained on a set of composite andelemental MDTs. The temporal structure of a composite task is assumed to be unknown and the architecture learns to produce a temporal decomposition.It is shown that under certain conditions the solution of a composite MDT can be constructed by computationally inexpensive modifications of the solutions of its constituent elemental MDTs. 1 INTRODUCTION Most applications of domain independent learning algorithms have focussed on learning single tasks. Building more sophisticated learning agents that operate in complex environments will require handling multiple tasks/goals (Singh, 1992). Research efforton the scaling problem has concentrated on discovering faster learning algorithms, and while that will certainly help, techniques that allow transfer of learning across tasks will be indispensable for building autonomous learning agents that have to learn to solve multiple tasks. In this paper I consider a learning agent that interacts with an external, finite-state, discrete-time, stochastic dynamical environment andfaces multiple sequences of Markovian decision tasks (MDTs).

artificial intelligence, composite task, neural network, (20 more...)

Country: North America > United States > Massachusetts > Hampshire County > Amherst (0.29)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.70)

Network generalization for production: Learning and producing styled letterforms

Grebert, Igor, Stork, David G., Keesing, Ron, Mims, Steve

Here during the production event a very low infonnational input ("Madonna," and "Matisse") is used

artificial intelligence, font, neural network, (19 more...)

Country:

North America > United States > California > San Mateo County (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Intrator, Nathan, Gold, Joshua I., Bülthoff, Heinrich H., Edelman, Shimon

3D Object Recognition Using Unsupervised Feature Extraction

Gold Center for Neural Science, Brown University Providence, RI 02912, USA Shimon Edelman Dept. of Applied Mathematics and Computer Science, Weizmann Institute of Science, Rehovot 76100, Israel Abstract Intrator (1990) proposed a feature extraction method that is related to recent statistical theory (Huber, 1985; Friedman, 1987), and is based on a biologically motivated model of neuronal plasticity (Bienenstock et al., 1982). This method has been recently applied to feature extraction in the context of recognizing 3D objects from single 2D views (Intrator and Gold, 1991). Here we describe experiments designed to analyze the nature of the extracted features, and their relevance to the theory and psychophysics of object recognition. 1 Introduction Results of recent computational studies of visual recognition (e.g., Poggio and Edelman, 1990)indicate that the problem of recognition of 3D objects can be effectively reformulated in terms of standard pattern classification theory. According to this approach, an object is represented by a few of its 2D views, encoded as clusters in multidimentional space. Recognition of a novel view is then carried out by interpo-460 3D Object Recognition Using Unsupervised Feature Extraction 461 lating among the stored views in the representation space.

artificial intelligence, data mining, experiment, (15 more...)

Country:

North America > United States > Rhode Island > Providence County > Providence (0.25)
Asia > Middle East > Israel (0.24)

Industry: Government (0.47)

Technology:

Information Technology > Data Science > Data Mining > Feature Extraction (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Venturini, Rita, Lytton, William W., Sejnowski, Terrence J.

Neural Network Analysis of Event Related Potentials and Electroencephalogram Predicts Vigilance

Automated monitoring of vigilance in attention intensive tasks such as air traffic control or sonar operation is highly desirable. As the operator monitorsthe instrument, the instrument would monitor the operator, insuring against lapses. We have taken a first step toward this goal by using feedforwardneural networks trained with backpropagation to interpret event related potentials (ERPs) and electroencephalogram (EEG) associated withperiods of high and low vigilance. The accuracy of our system on an ERP data set averaged over 28 minutes was 96%, better than the 83% accuracy obtained using linear discriminant analysis. Practical vigilance monitoring will require prediction over shorter time periods. We were able to average the ERP over as little as 2 minutes and still get 90% correct prediction of a vigilance measure. Additionally, we achieved similarly good performance using segments of EEG power spectrum as short as 56 sec.

air transportation, erp, neural network, (15 more...)

Country: North America > United States (0.48)

Industry:

Transportation > Infrastructure & Services (0.54)
Transportation > Air (0.54)
Health & Medicine > Therapeutic Area (0.49)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.86)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)

Sutton, Jeffrey P., Mamelak, Adam N., Hobson, J. Allan

Network Model of State-Dependent Sequencing

A network model with temporal sequencing and state-dependent modulatory featuresis described. The model is motivated by neurocognitive data characterizing different states of waking and sleeping. Computer studies demonstrate how unique states of sequencing can exist within the same network under different aminergic and cholinergic modulatory influences. Relationships between state-dependent modulation, memory, sequencing and learning are discussed.

artificial intelligence, health & medicine, sequence, (17 more...)

Country:

North America > United States > Massachusetts (0.28)
North America > United States > California > San Francisco County > San Francisco (0.28)

Industry: Health & Medicine > Therapeutic Area > Sleep (0.49)

Technology:

Information Technology > Artificial Intelligence (1.00)
Information Technology > Communications > Networks (0.86)

Ji, Chuanyi, Psaltis, Demetri

The VC-Dimension versus the Statistical Capacity of Multilayer Networks

The former characterizes their "Present Address: Department of Electrical Computer and System Engineering, Rensselaer Polytech Institute, Troy, NY 12180.

artificial intelligence, neural network, vc-dimension, (16 more...)

Country: North America > United States > New York > Rensselaer County > Troy (0.24)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (0.50)