AITopics | underfit

Collaborating Authors

underfit

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Token-Level Fitting Issues of Seq2seq Models

Bao, Guangsheng, Teng, Zhiyang, Zhang, Yue

arXiv.org Artificial IntelligenceJun-22-2023

Sequence-to-sequence (seq2seq) models have been widely used for natural language processing, computer vision, and other deep learning tasks. We find that seq2seq models trained with early-stopping suffer from issues at the token level. In particular, while some tokens in the vocabulary demonstrate overfitting, others underfit when training is stopped. Experiments show that the phenomena are pervasive in different models, even in fine-tuned large pretrained-models. We identify three major factors that influence token-level fitting, which include token frequency, parts-of-speech, and prediction discrepancy. Further, we find that external factors such as language, model size, domain, data scale, and pretraining can also influence the fitting of tokens.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2305.04493

Country: Asia > China > Inner Mongolia > Hohhot (0.04)

Genre: Research Report > Experimental Study (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

Bias-Variance Tradeoff

#artificialintelligenceJan-30-2022, 06:30:40 GMT

This article was published as part of the Data science Blogathon. In this post, we will explain the bias-variance tradeoff in machine learning and how we can get the most out of it. We will follow up with some illustrative examples and practical implementation in the end. First things first let us learn about what does bias, variance, and trade-off actually mean practically without learning some bookish definitions. So Bias is basically a part of a generalization error that comes across due to some wrong assumptions that we make during our development process i.e. assuming that the data is linear when it is actually quadratic.

bias-variance tradeoff, training data, variance, (11 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.58)

Add feedback

Neural Networks and Self Driving Car without any complex Mathematics :

#artificialintelligenceJul-8-2021, 19:00:26 GMT

My strategy in this article is to equip you with the basic steps that go into building a neural network in order to solve a task we are interested in. Rest assured, there will be no mathematics involved!!! Deep Learning technique requires us to train a Neural Network which is an algorithm that learns from data (usually more the data, better the learning). The principle is to provide lots of labelled data. Hey! wait a minute, i will explain what labelled data means, but let me first familiarise with the analogy that we are going to use in order to debunk the working of Neural Networks. Here, we will consider the scenario of a math class in school. I know math is not very exciting for most of the people, that's why i selected the math class to make math more interesting.

labelled data, neural network, student, (13 more...)

#artificialintelligence

Industry:

Transportation > Ground > Road (0.52)
Transportation > Passenger (0.42)
Information Technology > Robotics & Automation (0.42)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.55)

Add feedback

Undecidability of Underfitting in Learning Algorithms

Sehra, Sonia, Flores, David, Montanez, George D.

arXiv.org Artificial IntelligenceFeb-9-2021

Using recent machine learning results that present an information-theoretic perspective on underfitting and overfitting, we prove that deciding whether an encodable learning algorithm will always underfit a dataset, even if given unlimited training time, is undecidable. We discuss the importance of this result and potential topics for further research, including information-theoretic and probabilistic strategies for bounding learning algorithm fit.

algorithm, definition iii, underfit, (15 more...)

arXiv.org Artificial Intelligence

2102.0285

Country:

North America > United States > California > Los Angeles County > Claremont (0.05)
Europe > Middle East > Malta > Port Region > Southern Harbour District > Valletta (0.05)
North America > United States > Washington > King County > Redmond (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)

Add feedback

Is your model overfitting? Or maybe underfitting? An example using a neural network in python

#artificialintelligenceJul-13-2020, 16:25:45 GMT

Underfitting means that our ML model can neither model the training data nor generalize to new unseen data. A model that underfits the data will have poor performance on the training data. For example, in a scenario where someone would use a linear model to capture non-linear trends in the data, the model would underfit the data. A textbook case of underfitting is when the model's error on both the training and test sets (i.e. during training and testing) is very high. It is obvious that there is a trade-off between overfitting and underfitting.

artificial intelligence, machine learning, neural network, (13 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.90)

Add feedback

A Non-Parametric Test to Detect Data-Copying in Generative Models

Meehan, Casey, Chaudhuri, Kamalika, Dasgupta, Sanjoy

arXiv.org Machine LearningApr-12-2020

Detecting overfitting in generative models is an important challenge in machine learning. In this work, we formalize a form of overfitting that we call {\em{data-copying}} -- where the generative model memorizes and outputs training samples or small variations thereof. We provide a three sample non-parametric test for detecting data-copying that uses the training set, a separate sample from the target distribution, and a generated sample from the model, and study the performance of our test on several canonical models and datasets. For code \& examples, visit https://github.com/casey-meehan/data-copying

artificial intelligence, generative model, machine learning, (16 more...)

arXiv.org Machine Learning

2004.05675

Country: North America > United States > California (0.28)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback