AITopics

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Neural Information Processing SystemsDec-31-1994

Learning Curves: Asymptotic Values and Rate of Convergence

Cortes, Corinna, Jackel, L. D., Solla, Sara A., Vapnik, Vladimir, Denker, John S.

Training classifiers on large databases is computationally demanding. It is desirable to develop efficient procedures for a reliable prediction of a classifier's suitability for implementing a given task, so that resources can be assigned to the most promising candidates or freed for exploring new classifier candidates. We propose such a practical and principled predictive method. Practical because it avoids the costly procedure of training poor classifiers on the whole training set, and principled because of its theoretical foundation. The effectiveness of the proposed procedure is demonstrated for both single-and multi-layer networks.

artificial intelligence, classifier, machine learning, (16 more...)

Industry: Materials > Chemicals (0.30)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Second Order Properties of Error Surfaces: Learning Time and Generalization

LeCun, Yann, Kanter, Ido, Solla, Sara A.

The learning time of a simple neural network model is obtained through an analytic computation of the eigenvalue spectrum for the Hessian matrix, which describes the second order properties of the cost function in the space of coupling coefficients. The form of the eigenvalue distribution suggests new techniques for accelerating the learning process, and provides a theoretical justification for the choice of centered versus biased state variables.

artificial intelligence, eigenvalue, neural network, (16 more...)

Country: North America > United States (0.29)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Second Order Properties of Error Surfaces: Learning Time and Generalization

LeCun, Yann, Kanter, Ido, Solla, Sara A.

artificial intelligence, eigenvalue, neural network, (16 more...)

Country: North America > United States (0.29)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Second Order Properties of Error Surfaces: Learning Time and Generalization

LeCun, Yann, Kanter, Ido, Solla, Sara A.

Holmdel, NJ 07733, USA The learning time of a simple neural network model is obtained through an analytic computation of the eigenvalue spectrum for the Hessian matrix, which describes the second order properties of the cost function in the space of coupling coefficients. The form of the eigenvalue distribution suggests new techniques for accelerating the learning process, and provides a theoretical justification for the choice of centered versus biased state variables.

artificial intelligence, eigenvalue, neural network, (16 more...)

Country: North America > United States (0.49)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Neural Network Implementation of Admission Control

Milito, Rodolfo A., Guyon, Isabelle, Solla, Sara A.

A feedforward layered network implements a mapping required to control an unknown stochastic nonlinear dynamical system. Training is based on a novel approach that combines stochastic approximation ideas with backpropagation. Themethod is applied to control admission into a queueing system operating in a time-varying environment.

artificial intelligence, controller, neural network, (16 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Neural Network Implementation of Admission Control

Milito, Rodolfo A., Guyon, Isabelle, Solla, Sara A.

A feedforward layered network implements a mapping required to control an unknown stochastic nonlinear dynamical system. Training is based on a novel approach that combines stochastic approximation ideas with backpropagation. The method is applied to control admission into a queueing system operating in a time-varying environment.

artificial intelligence, controller, neural network, (16 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Neural Information Processing SystemsDec-31-1990

Optimal Brain Damage

LeCun, Yann, Denker, John S., Solla, Sara A.

We have used information-theoretic ideas to derive a class of practical and nearly optimal schemes for adapting the size of a neural network. By removing unimportant weights from a network, several improvements can be expected: better generalization, fewer training examples required, and improved speed of learning and/or classification. The basic idea is to use second-derivative information to make a tradeoff between network complexity and training set error. Experiments confirm the usefulness of the methods on a real-world application. 1 INTRODUCTION Most successful applications of neural network learning to real-world problems have been achieved using highly structured networks of rather large size [for example (Waibel, 1989; Le Cun et al., 1990a)]. As applications become more complex, the networks will presumably become even larger and more structured.

le cun, neural network, neurology, (17 more...)

Country: Europe > Switzerland > Zürich > Zürich (0.14)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.43)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Neural Information Processing SystemsDec-31-1990

Optimal Brain Damage

LeCun, Yann, Denker, John S., Solla, Sara A.

We have used information-theoretic ideas to derive a class of practical andnearly optimal schemes for adapting the size of a neural network. By removing unimportant weights from a network, several improvementscan be expected: better generalization, fewer training examples required, and improved speed of learning and/or classification. The basic idea is to use second-derivative information tomake a tradeoff between network complexity and training set error. Experiments confirm the usefulness of the methods on a real-world application. 1 INTRODUCTION Most successful applications of neural network learning to real-world problems have been achieved using highly structured networks of rather large size [for example (Waibel, 1989; Le Cun et al., 1990a)]. As applications become more complex, the networks will presumably become even larger and more structured.

le cun, neural network, neurology, (17 more...)

Country: Europe > Switzerland > Zürich > Zürich (0.14)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.43)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)