AITopics | unreasonable ineffectiveness

Collaborating Authors

unreasonable ineffectiveness

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

The Unreasonable Ineffectiveness of Nucleus Sampling on Mitigating Text Memorization

Borec, Luka, Sadler, Philipp, Schlangen, David

arXiv.org Artificial IntelligenceAug-29-2024

This work analyses the text memorization behavior of large language models (LLMs) when subjected to nucleus sampling. Stochastic decoding methods like nucleus sampling are typically applied to overcome issues such as monotonous and repetitive text generation, which are often observed with maximization-based decoding techniques. We hypothesize that nucleus sampling might also reduce the occurrence of memorization patterns, because it could lead to the selection of tokens outside the memorized sequence. To test this hypothesis we create a diagnostic dataset with a known distribution of duplicates that gives us some control over the likelihood of memorization of certain parts of the training data. Our analysis of two GPT-Neo models fine-tuned on this dataset interestingly shows that (i) an increase of the nucleus size reduces memorization only modestly, and (ii) even when models do not engage in "hard" memorization -- a verbatim reproduction of training samples -- they may still display "soft" memorization whereby they generate outputs that echo the training data but without a complete one-by-one resemblance.

mitigating text memorization, nucleus sampling, unreasonable ineffectiveness

arXiv.org Artificial Intelligence

2408.16345

Genre: Research Report (0.69)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Memory-Based Learning > Rote Learning (1.00)

Add feedback

The Unreasonable Ineffectiveness of Deep Learning in NLU

@machinelearnbotJun-20-2017, 21:10:17 GMT

I often get pitched with a superior deep learning solution for Natural Language Understanding (NLU). After all, deep learning is the disruptive new force in AI. A better NLU AI entices many useful advancements, ranging from smarter chat bots and virtual assistants to news categorization, with an ultimate promise of better language comprehension. Lets assume this superior deep learning (DL) "product" is called "(dot)AI". Their pitch deck will invariably have a bar chart that looks something like this -- the claim being that the new DL topic classifier/tagger of (Dot)AI is better than state of the art methods.

artificial intelligence, machine learning, natural language, (14 more...)

@machinelearnbot

Genre: Research Report > Promising Solution (0.35)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

The Unreasonable Ineffectiveness of Machine Learning in Computer Systems Research

#artificialintelligenceApr-10-2017, 13:30:39 GMT

In 1960, the physicist Eugene Wigner wrote a famous essay titled "The Unreasonable Effectiveness of Mathematics in the Natural Sciences" in which he explored the question of why mathematics is so remarkably useful in the natural sciences. A contemporary example of such "unreasonable effectiveness" is the success that machine learning has had in transforming many disciplines in the past decade. Particularly impressive is the progress in autonomous vehicles. In the 2004 DARPA Grand Challenge for autonomous vehicles, which popularized the idea of driverless cars, none of the vehicles was able to complete a relatively simple route through the Mojave Desert, and I thought it unlikely that I would see driverless cars operating in urban environments in my lifetime. Since that time, progress in this area has been phenomenal, thanks to rapid advances in using machine learning for sensing and navigation (and in building low-cost sensors and controls).

artificial intelligence, machine learning, predictor, (12 more...)

#artificialintelligence

Industry:

Transportation > Ground > Road (0.95)
Transportation > Passenger (0.58)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)

Add feedback