AITopics

We consider the problem of online gradient descent learning for general two-layer neural networks. An analytic solution is presented and used to investigate the role of the learning rate in controlling the evolution and convergence of the learning process. Two-layer networks with an arbitrary number of hidden units have been shown to be universal approximators [1] for such N-to-one dimensional maps. We investigate the emergence of generalization ability in an online learning scenario [2], in which the couplings are modified after the presentation of each example so as to minimize the corresponding error. The resulting changes in {J} are described as a dynamical evolution; the number of examples plays the role of time.

equation, generalization error, vector, (13 more...)

Country:

North America > United States (0.04)
Europe > United Kingdom (0.04)
Europe > Denmark > Capital Region > Copenhagen (0.04)

Industry: Education (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.62)

Sollich, Peter, Krogh, Anders

Learning with ensembles: How overfitting can be useful

We study the characteristics of learning with ensembles. Solving exactly the simple model of an ensemble of linear students, we find surprisingly rich behaviour. For learning in large ensembles, it is advantageous to use under-regularized students, which actually over-fit the training data. Globally optimal performance can be obtained by choosing the training set sizes of the students appropriately. For smaller ensembles, optimization of the ensemble weights can yield significant improvements in ensemble generalization performance, in particular if the individual students are subject to noise in the training process. Choosing students with a wide range of regularization parameters makes this improvement robust against changes in the unknown level of noise in the training data. 1 INTRODUCTION An ensemble is a collection of a (finite) number of neural networks or other types of predictors that are trained for the same task.

ensemble, generalization error, student, (17 more...)

Country: Europe > Denmark > Capital Region > Copenhagen (0.04)

Industry: Education (0.36)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.36)

Is Learning The n-th Thing Any Easier Than Learning The First?

Thrun, Sebastian

This paper investigates learning in a lifelong context. Lifelong learning addresses situations in which a learner faces a whole stream of learning tasks.Such scenarios provide the opportunity to transfer knowledge across multiple learning tasks, in order to generalize more accurately from less training data. In this paper, several different approaches to lifelong learning are described, and applied in an object recognition domain. It is shown that across the board, lifelong learning approaches generalize consistently more accurately from less training data, by their ability to transfer knowledge across learning tasks. 1 Introduction Supervised learning is concerned with approximating an unknown function based on examples. Virtuallyall current approaches to supervised learning assume that one is given a set of input-output examples, denoted by X, which characterize an unknown function, denoted by f.

artificial intelligence, knowledge, machine learning, (17 more...)

Country: North America > United States > California (0.15)

Genre:

Overview (0.74)
Research Report (0.54)

Industry: Education (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)

Schraudolph, Nicol N., Sejnowski, Terrence J.

Tempering Backpropagation Networks: Not All Weights are Created Equal

Backpropagation learning algorithms typically collapse the network's structure into a single vector of weight parameters to be optimized. We suggest that their performance may be improved by utilizing the structural informationinstead of discarding it, and introduce a framework for ''tempering'' each weight accordingly. In the tempering model, activation and error signals are treated as approximately independentrandom variables. The characteristic scale of weight changes is then matched to that ofthe residuals, allowing structural properties suchas a node's fan-in and fan-out to affect the local learning rate and backpropagated error. The model also permits calculation of an upper bound on the global learning rate for batch updates, which in turn leads to different update rules for bias vs. non-bias weights. This approach yields hitherto unparalleled performance on the family relations benchmark,a deep multi-layer network: for both batch learning with momentum and the delta-bar-delta algorithm, convergence at the optimal learning rate is sped up by more than an order of magnitude.

artificial intelligence, learning rate, machine learning, (16 more...)

Country: North America > United States (0.69)

Industry: Education (0.36)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Backpropagation (0.65)

West, Ansgar H. L., Saad, David

Adaptive Back-Propagation in On-Line Learning of Multilayer Networks

This research has been motivated by the dominance of the suboptimal symmetric phase in online learning of two-layer feedforward networks trained by gradient descent [2]. This trapping is emphasized for inappropriate small learning rates but exists in all training scenarios, effecting the learning process considerably. We Adaptive Back-Propagation in Online Learning of Multilayer Networks 329 proposed an adaptive back-propagation training algorithm [Eq.

artificial intelligence, gradient descent, machine learning, (12 more...)

Genre: Instructional Material > Online (0.40)

Industry:

Education > Educational Setting > Online (1.00)
Education > Educational Technology > Educational Software > Computer Based Training (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Saad, David, Solla, Sara A.

Dynamics of On-Line Gradient Descent Learning for Multilayer Neural Networks

Sollat CONNECT, The Niels Bohr Institute Blegdamsdvej 17 Copenhagen 2100, Denmark Abstract We consider the problem of online gradient descent learning for general two-layer neural networks. An analytic solution is presented andused to investigate the role of the learning rate in controlling theevolution and convergence of the learning process. Two-layer networks with an arbitrary number of hidden units have been shown to be universal approximators [1] for such N-to-one dimensional maps. We investigate the emergence of generalization ability in an online learning scenario [2], in which the couplings are modified after the presentation of each example so as to minimize the corresponding error. The resulting changes in {J} are described as a dynamical evolution; the number of examples plays the role of time.

artificial intelligence, generalization error, machine learning, (15 more...)

Country: Europe > Denmark > Capital Region > Copenhagen (0.24)

Industry: Education (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.62)

Sollich, Peter, Krogh, Anders

Learning with ensembles: How overfitting can be useful

AndersKrogh'" NORDITA, Blegdamsvej 17 2100 Copenhagen, Denmark kroghGsanger.ac.uk Abstract We study the characteristics of learning with ensembles. Solving exactly the simple model of an ensemble of linear students, we find surprisingly rich behaviour. For learning in large ensembles, it is advantageous to use under-regularized students, which actually over-fitthe training data. Globally optimal performance can be obtained by choosing the training set sizes of the students appropriately. Forsmaller ensembles, optimization of the ensemble weights can yield significant improvements in ensemble generalization performance,in particular if the individual students are subject to noise in the training process. Choosing students with a wide range of regularization parameters makes this improvement robust against changes in the unknown level of noise in the training data. 1 INTRODUCTION An ensemble is a collection of a (finite) number of neural networks or other types of predictors that are trained for the same task.

artificial intelligence, machine learning, student, (18 more...)

Country: Europe > Denmark > Capital Region > Copenhagen (0.24)

Industry: Education (0.36)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.36)

AAAI News

Hamilton, Carol

AI MagazineDec-15-1996

Topics An all-star panel assembled to pose life, and a wealth of other topics were of the technical papers spanned a "Challenge Problems for Artificial Intelligence."

application, space agency, us government, (25 more...)

AI Magazine

Country:

North America > United States > California (0.28)
North America > United States > Oregon (0.14)
Oceania > Australia (0.14)
(5 more...)

Genre:

Instructional Material (0.93)
Press Release (0.68)
Personal > Honors (0.46)

Industry:

Telecommunications (1.00)
Health & Medicine (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
(6 more...)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
(2 more...)

Cheng, Chun Hung, Holsapple, Clyde W., Lee, Anita

Citation-Based Journal Rankings for AI Research A Business Perspective

AI MagazineJun-15-1996

A significant and growing area of business-computing research is concerned with AI. Knowledge about which journals are the most influential forums for disseminating AI research is important for business school faculty, students, administrators, and librarians. To date, there has been only one study attempting to rank AI journals from a business-computing perspective. It used a subjective methodology, surveying opinions of business faculty about a prespecified list of 30 journals. Here, we report the results of a more objective study. We conducted a citation analysis covering a time period of 5 years to compile 15,600 citations to 1,244 different journals. Based on these data, the journals are ranked in two ways involving the magnitude and the duration of scientific impact each has had in the field of AI.

artificial intelligence, expert system, ieee transaction, (12 more...)

AI Magazine

Country: North America > United States (0.47)

Genre:

Research Report (0.68)
Overview (0.46)

Industry:

Media > Publishing (0.72)
Education (0.46)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (0.75)

Hinkle, David, Kortenkamp, David, Miller, David

The 1995 Robot Competition and Exhibition

AI MagazineMar-15-1996

The 1995 Robot Competition and Exhibition was held in Montreal, Canada, in conjunction with the 1995 International Joint Conference on Artificial Intelligence. The competition was designed to demonstrate state-of-the-art autonomous mobile robots, highlighting such tasks as goal-directed navigation, feature detection, object recognition, identification, and physical manipulation as well as effective human-robot communication. The competition consisted of two separate events: (1) Office Delivery and (2) Office Cleanup. The exhibition also consisted of two events: (1) demonstrations of robotics research that was not related to the contest and (2) robotics focused on aiding people who are mobility impaired. There was also a Robotics Forum for technical exchange of information between robotics researchers. Thus, this year's events covered the gamut of robotics research, from discussions of control strategies to demonstrations of useful prototype application systems.

artificial intelligence, competition, robot, (16 more...)

AI Magazine

Country:

North America > United States > California (0.28)
North America > Canada > Quebec > Montreal (0.24)

Industry:

Education (1.00)
Government > Space Agency (0.46)
Government > Regional Government > North America Government > United States Government (0.46)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)