AITopics | transfer test

Collaborating Authors

transfer test

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Large Language Models are Biased Reinforcement Learners

Hayes, William M., Yax, Nicolas, Palminteri, Stefano

arXiv.org Artificial IntelligenceMay-18-2024

In-context learning enables large language models (LLMs) to perform a variety of tasks, including learning to make reward-maximizing choices in simple bandit tasks. Given their potential use as (autonomous) decision-making agents, it is important to understand how these models perform such reinforcement learning (RL) tasks and the extent to which they are susceptible to biases. Motivated by the fact that, in humans, it has been widely documented that the value of an outcome depends on how it compares to other local outcomes, the present study focuses on whether similar value encoding biases apply to how LLMs encode rewarding outcomes. Results from experiments with multiple bandit tasks and models show that LLMs exhibit behavioral signatures of a relative value bias. Adding explicit outcome comparisons to the prompt produces opposing effects on performance, enhancing maximization in trained choice sets but impairing generalization to new choice sets. Computational cognitive modeling reveals that LLM behavior is well-described by a simple RL algorithm that incorporates relative values at the outcome encoding stage. Lastly, we present preliminary evidence that the observed biases are not limited to fine-tuned LLMs, and that relative value processing is detectable in the final hidden layer activations of a raw, pretrained model. These findings have important implications for the use of LLMs in decision-making applications.

comparison prompt, relative value, transfer test, (15 more...)

arXiv.org Artificial Intelligence

2405.11422

Country: North America > United States > New York > Broome County > Binghamton (0.04)

Genre: Research Report > Experimental Study (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Relative Value Biases in Large Language Models

Hayes, William M., Yax, Nicolas, Palminteri, Stefano

arXiv.org Artificial IntelligenceJan-25-2024

Studies of reinforcement learning in humans and animals have demonstrated a preference for options that yielded relatively better outcomes in the past, even when those options are associated with lower absolute reward. The present study tested whether large language models would exhibit a similar bias. We had gpt-4-1106-preview (GPT-4 Turbo) and Llama-2-70B make repeated choices between pairs of options with the goal of maximizing payoffs. A complete record of previous outcomes was included in each prompt. Both models exhibited relative value decision biases similar to those observed in humans and animals. Making relative comparisons among outcomes more explicit magnified the bias, whereas prompting the models to estimate expected outcomes caused the bias to disappear. These results have implications for the potential mechanisms that contribute to context-dependent choice in human agents.

language model, relative valuation, relative value bias, (13 more...)

arXiv.org Artificial Intelligence

2401.1453

Country:

North America > United States > New York > Broome County > Binghamton (0.05)
Europe > France > Île-de-France > Paris > Paris (0.04)

Genre:

Research Report > New Finding (0.46)
Research Report > Experimental Study (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

A Recurrent Neural Network for Word Identification from Continuous Phoneme Strings

Neural Information Processing SystemsApr-6-2023, 19:32:05 GMT

A neural network architecture was designed for locating word boundaries and identifying words from phoneme sequences. This architecture was tested in three sets of studies. First, a highly redundant corpus with a restricted vocabulary was generated and the network was trained with a limited number of phonemic variations for the words in the corpus. Tests of network performance on a transfer set yielded a very low error rate. In a second study, a network was trained to identify words from expert transcriptions of speech.

continuous phoneme string, recurrent neural network, word identification, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.40)

Add feedback

PNY LX3030 SSD review: Incredible durability for twice the price

PCWorldSep-28-2021, 10:45:00 GMT

It's marketed directly at Chia cryptocurrency plotting, a very high-bandwidth sustained write task. If you want some info on how much data Chia requires, you can find it here. But if your workload involves something similar, such as continuous large-scale backup, video encoding, or anything else that involves writing lots and lots of data, it might also be of interest. The LX3030 is the fastest PCIe 3.0-based sustained writer we've tested and its TBW (TeraBytes that can be Written) ratings are astounding: 27,000TBW per 1TB of NAND. Seagate's scorching fast FireCuda 530 is rated for 1,250TBW per terabyte--a lot of data by normal standards, but shy one zero compared to the PNY's rated durability.

durability, lx3030, tbw rating, (16 more...)

PCWorld

Industry:

Transportation > Passenger (0.40)
Transportation > Ground > Road (0.40)
Automobiles & Trucks > Manufacturer (0.40)
Information Technology > Hardware (0.36)

Technology:

Information Technology > Hardware (0.71)
Information Technology > Artificial Intelligence (0.49)

Add feedback