Use of Bad Training Data for Better Predictions

Dec-31-1994–Neural Information Processing Systems

We show how randomly scrambling the output classes of various fractions of the training data may be used to improve predictive accuracy of a classification algorithm. We present a method for calculating the "noise sensitivity signature" of a learning algorithm which is based on scrambling the output classes. This signature can be used to indicate a good match between the complexity of the classifier and the complexity of the data. Use of noise sensitivity signatures is distinctly different from other schemes to avoid overtraining, such as cross-validation, which uses only part of the training data, or various penalty functions, which are not data-adaptive. Noise sensitivity signature methods use all of the training data and are manifestly data-adaptive and nonparametric. They are well suited for situations with limited training data. 1 INTRODUCTION A major problem of pattern recognition and classification algorithms that learn from a training set of examples is to select the complexity of the model to be trained. How is it possible to avoid an overparameterized algorithm from "memorizing" the training data?

artificial intelligence, neural network, training set, (16 more...)

Neural Information Processing Systems

Dec-31-1994

Conferences PDF

Add feedback

Country:
- North America > United States > New Mexico (0.29)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.95)

Duplicate Docs Excel Report

Title
Use of Bad Training Data for Better Predictions
Use of Bad Training Data for Better Predictions

Similar Docs Excel Report more

Title	Similarity	Source
None found