AITopics

G/SPLINES is an algorithm for building functional models of data. It uses genetic search to discover combinations of basis functions which are then used to build a least-squares regression model. Because it produces a population of models which evolve over time rather than a single model, it allows analysis not possible with other regression-based approaches. 1 INTRODUCTION G/SPLINES is a hybrid of Friedman's Multivariable Adaptive Regression Splines (MARS) algorithm (Friedman, 1990) with Holland's Genetic Algorithm (Holland, 1975). G/SPLINES has advantages over MARS in that it requires fewer least-squares computations, is easily extendable to non-spline basis functions, may discover models inaccessible to local-variable selection algorithms, and allows significantly larger problems to be considered. These issues are discussed in (Rogers, 1991). This paper begins with a discussion of linear regression models, followed by a description of the G/SPLINES algorithm, and finishes with a series of experiments illustrating its performance, robustness, and analysis capabilities.

artificial intelligence, basis function, machine learning, (17 more...)

Country:

North America > United States > Michigan (0.14)
North America > United States > California (0.14)

Industry: Government (0.30)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

Node Splitting: A Constructive Algorithm for Feed-Forward Neural Networks

Wynne-Jones, Mike

The small network forms an approximate model of a set of training data, and the split creates a larger more powerful network which is initialised with the approximate solution already found. The insufficiency of the smaller network in modelling the system which generated the data leads to oscillation in those hidden nodes whose weight vectors cover regions inthe input space where more detail is required in the model. These nodes are identified and split in two using principal component analysis, allowing the new nodes t.o cover the two main modes of each oscillating vector. Nodes are selected for splitting using principal component analysis on the oscillating weight vectors, or by examining the Hessian matrix of second derivatives of the network error with respect to the weight.s.

artificial intelligence, machine learning, node, (13 more...)

Country: North America > United States > California (0.29)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.45)

Best-First Model Merging for Dynamic Learning and Recognition

Omohundro, Stephen M.

Stephen M. Omohundro International Computer Science Institute 1947 CenteJ' Street, Suite 600 Berkeley, California 94704 Abstract "Best-first model merging" is a general technique for dynamically choosing the structure of a neural or related architecture while avoiding overfitting.It is applicable to both leaming and recognition tasks and often generalizes significantly better than fixed structures. We demonstrate theapproach applied to the tasks of choosing radial basis functions for function learning, choosing local affine models for curve and constraint surface modelling, and choosing the structure of a balltree or bumptree to maximize efficiency of access. 1 TOWARD MORE COGNITIVE LEARNING Standard backpropagation neural networks learn in a way which appears to be quite different fromhuman leaming. Viewed as a cognitive system, a standard network always maintains acomplete model of its domain. This model is mostly wrong initially, but gets gradually better and better as data appears. The net deals with all data in much the same way and has no representation for the strength of evidence behind a certain conclusion. The network architecture is usually chosen before any data is seen and the processing is much the same in the early phases of learning as in the late phases.

artificial intelligence, best-first model merging, machine learning, (16 more...)

Country: North America > United States > California > Alameda County > Berkeley (0.24)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.92)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.47)

The Effective Number of Parameters: An Analysis of Generalization and Regularization in Nonlinear Learning Systems

Moody, John E.

We present an analysis of how the generalization performance (expected test set error) relates to the expected training set error for nonlinear learning systems,such as multilayer perceptrons and radial basis functions.

artificial intelligence, effective number, machine learning, (15 more...)

Country: North America > United States (0.29)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.55)

Principles of Risk Minimization for Learning Theory

Vapnik, V.

Learning is posed as a problem of function estimation, for which two principles ofsolution are considered: empirical risk minimization and structural risk minimization. These two principles are applied to two different statements ofthe function estimation problem: global and local. Systematic improvements in prediction power are illustrated in application to zip-code recognition.

artificial intelligence, machine learning, minimization, (15 more...)

Country: North America > United States (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Venturini, Rita, Lytton, William W., Sejnowski, Terrence J.

Neural Network Analysis of Event Related Potentials and Electroencephalogram Predicts Vigilance

Automated monitoring of vigilance in attention intensive tasks such as air traffic control or sonar operation is highly desirable. As the operator monitorsthe instrument, the instrument would monitor the operator, insuring against lapses. We have taken a first step toward this goal by using feedforwardneural networks trained with backpropagation to interpret event related potentials (ERPs) and electroencephalogram (EEG) associated withperiods of high and low vigilance. The accuracy of our system on an ERP data set averaged over 28 minutes was 96%, better than the 83% accuracy obtained using linear discriminant analysis. Practical vigilance monitoring will require prediction over shorter time periods. We were able to average the ERP over as little as 2 minutes and still get 90% correct prediction of a vigilance measure. Additionally, we achieved similarly good performance using segments of EEG power spectrum as short as 56 sec.

artificial intelligence, erp, machine learning, (12 more...)

Country: North America > United States (0.48)

Industry:

Transportation > Infrastructure & Services (0.54)
Transportation > Air (0.54)
Health & Medicine > Therapeutic Area (0.49)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.86)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)

Structural Risk Minimization for Character Recognition

Guyon, I., Vapnik, V., Boser, B., Bottou, L., Solla, S. A.

The method of Structural Risk Minimization refers to tuning the capacity of the classifier to the available amount of training data.

artificial intelligence, classifier, machine learning, (15 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Frawley, William J., Piatetsky-Shapiro, Gregory, Matheus, Christopher J.

Knowledge Discovery in Databases: An Overview

AI MagazineSep-15-1992

After a decade of fundamental interdisciplinary research in machine learning, the spadework in this field has been done; the 1990s should see the widespread exploitation of knowledge discovery as an aid to assembling knowledge bases. The contributors to the AAAI Press book Knowledge Discovery in Databases were excited at the potential benefits of this research. The editors hope that some of this excitement will communicate itself to "AI Magazine readers of this article.

database, discovery, knowledge, (11 more...)

AI Magazine

Country:

North America > Haiti (0.14)
North America > United States > California > San Mateo County > San Mateo (0.04)
North America > United States > New York (0.04)
(8 more...)

Genre: Overview (0.68)

Industry:

Health & Medicine > Therapeutic Area (1.00)
Banking & Finance (0.93)
Government (0.93)
Law (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Data Science > Data Mining > Knowledge Discovery (0.82)

AI Review

Graube, Nicholas, Males, Kevin, Schwartz, Tom, Mandelker, Dave, AAAI,

AI MagazineJun-15-1992

An effective to develop an abstract UI model which solution to this problem must have three encompasses the essence of those systems.

knowledge management, machine learning, real time system, (24 more...)

AI Magazine

Country:

Europe (1.00)
North America > United States > Massachusetts > Middlesex County (0.67)
North America > United States > California > Santa Clara County (0.46)
(2 more...)

Genre: Instructional Material > Course Syllabus & Notes (0.45)

Industry:

Media (1.00)
Information Technology (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
(7 more...)

Technology:

Information Technology > Software > Programming Languages (1.00)
Information Technology > Software Engineering (1.00)
Information Technology > Sensing and Signal Processing (1.00)
(19 more...)

A training algorithm for optimal margin classifiers

Boser, B. | Guyon, I. | Vapnik, V. N.

ClassicsFeb-1-1992

A training algorithm that maximizes the margin between the training patterns and the decision boundary is presented. The technique is applicable to a wide variety of the classification functions, including Perceptrons, polynomials, and Radial Basis Functions. The effective number of parameters is adjusted automatically to match the complexity of the problem. The solution is expressed as a linear combination of supporting patterns. These are the subset of training patterns that are closest to the decision boundary.

artificial intelligence, machine learning, training algorithm, (3 more...)

Classics

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)