The Effect of Correlated Input Data on the Dynamics of Learning

Dec-31-1997–Neural Information Processing Systems

The convergence properties of the gradient descent algorithm in the case of the linear perceptron may be obtained from the response function. We derive a general expression for the response function and apply it to the case of data with simple input correlations. It is found that correlations severely may slow down learning. This explains the success of PCA as a method for reducing training time. Motivated by this finding we furthermore propose to transform the input data by removing the mean across input variables as well as examples to decrease correlations. Numerical findings for a medical classification problem are in fine agreement with the theoretical results. 1 INTRODUCTION Learning and generalization are important areas of research within the field of neural networks.Although good generalization is the ultimate goal in feed-forward networks (perceptrons), it is of practical importance to understand the mechanism which control the amount of time required for learning, i. e. the dynamics of learning. Thisis of course particularly important in the case of a large data set. An exact analysis of this mechanism is possible for the linear perceptron and as usual it is hoped that the results to some extend may be carried over to explain the behaviour of nonlinear perceptrons.

artificial intelligence, eigenvalue spectrum, neural network, (16 more...)

Neural Information Processing Systems

Dec-31-1997

Conferences PDF

Add feedback

Country:
- Europe > Denmark (0.14)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (1.00)

Duplicate Docs Excel Report

Title
The Effect of Correlated Input Data on the Dynamics of Learning

Similar Docs Excel Report more

Title	Similarity	Source
None found