AITopics | bad training data

Collaborating Authors

bad training data

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Use of Bad Training Data for Better Predictions

Neural Information Processing SystemsApr-6-2023, 18:48:15 GMT

We show how randomly scrambling the output classes of various fractions of the training data may be used to improve predictive accuracy of a classification algorithm. We present a method for calculating the "noise sensitivity signature" of a learning algorithm which is based on scrambling the output classes. This signature can be used to indicate a good match between the complexity of the classifier and the complexity of the data. Use of noise sensitivity signatures is distinctly different from other schemes to avoid over(cid:173) training, such as cross-validation, which uses only part of the train(cid:173) ing data, or various penalty functions, which are not data-adaptive. Noise sensitivity signature methods use all of the training data and are manifestly data-adaptive and non-parametric.

bad training data, better prediction, noise sensitivity signature, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis (0.66)

Add feedback

'Racist' AI art warns against bad training data

#artificialintelligenceSep-17-2019, 17:00:31 GMT

An artificial-intelligence art project has been criticised for using racist and sexist tags to classify its users. When they share a selfie with ImageNet Roulette, the web app matches it to the ones it most closely resembles from an enormous library of profile photos. It then reveals the most popular tag, assigned to the matching pictures by human workers using data set WordNet. These include racial slurs, "first offender", "rape suspect", "spree killer", "newsreader", and "Batman". Those responsible for assigning the tags to the library pictures were recruited via a service offered by Amazon, called Mechanical Turk, which pays workers around the world pennies to perform small, monotonous tasks.

artificial intelligence, bad training data, machine learning, (4 more...)

#artificialintelligence

Country: Asia > Middle East > Republic of Türkiye > Batman Province > Batman (0.29)

Industry:

Law > Civil Rights & Constitutional Law (0.65)
Law > Criminal Law (0.63)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.63)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback

Use of Bad Training Data for Better Predictions

Grossman, Tal, Lapedes, Alan

Neural Information Processing SystemsDec-31-1994

We show how randomly scrambling the output classes of various fractions of the training data may be used to improve predictive accuracy of a classification algorithm. We present a method for calculating the "noise sensitivity signature" of a learning algorithm which is based on scrambling the output classes. This signature can be used to indicate a good match between the complexity of the classifier and the complexity of the data. Use of noise sensitivity signatures is distinctly different from other schemes to avoid overtraining, such as cross-validation, which uses only part of the training data, or various penalty functions, which are not data-adaptive. Noise sensitivity signature methods use all of the training data and are manifestly data-adaptive and nonparametric. They are well suited for situations with limited training data. 1 INTRODUCTION A major problem of pattern recognition and classification algorithms that learn from a training set of examples is to select the complexity of the model to be trained. How is it possible to avoid an overparameterized algorithm from "memorizing" the training data?

algorithm, architecture, classifier, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > New Mexico > Los Alamos County > Los Alamos (0.05)
North America > United States > New Mexico > Santa Fe County > Santa Fe (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.95)

Add feedback

Use of Bad Training Data for Better Predictions

Grossman, Tal, Lapedes, Alan

Neural Information Processing SystemsDec-31-1994

We show how randomly scrambling the output classes of various fractions of the training data may be used to improve predictive accuracy of a classification algorithm. We present a method for calculating the "noise sensitivity signature" of a learning algorithm which is based on scrambling the output classes. This signature can be used to indicate a good match between the complexity of the classifier and the complexity of the data. Use of noise sensitivity signatures is distinctly different from other schemes to avoid overtraining, such as cross-validation, which uses only part of the training data, or various penalty functions, which are not data-adaptive. Noise sensitivity signature methods use all of the training data and are manifestly data-adaptive and nonparametric. They are well suited for situations with limited training data. 1 INTRODUCTION A major problem of pattern recognition and classification algorithms that learn from a training set of examples is to select the complexity of the model to be trained. How is it possible to avoid an overparameterized algorithm from "memorizing" the training data?

algorithm, architecture, classifier, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > New Mexico > Los Alamos County > Los Alamos (0.05)
North America > United States > New Mexico > Santa Fe County > Santa Fe (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.95)

Add feedback

Use of Bad Training Data for Better Predictions

Grossman, Tal, Lapedes, Alan

Neural Information Processing SystemsDec-31-1994

artificial intelligence, classifier, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > United States > New Mexico (0.29)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.95)

Add feedback