AITopics | guinea pig

Large Language Models (LLMs) are highly sensitive to prompts, including additional context provided therein. As LLMs grow in capability, understanding their prompt-sensitivity becomes increasingly crucial for ensuring reliable and robust performance, particularly since evaluating these models becomes more challenging. In this work, we investigate how current models (Llama, Mixtral, Falcon) respond when presented with additional input from another model, mimicking a scenario where a more capable model -- or a system with access to more external information -- provides supplementary information to the target model. Across a diverse spectrum of question-answering tasks, we study how an LLM's response to multiple-choice questions changes when the prompt includes a prediction and explanation from another model. Specifically, we explore the influence of the presence of an explanation, the stated authoritativeness of the source, and the stated confidence of the supplementary input. Our findings reveal that models are strongly influenced, and when explanations are provided they are swayed irrespective of the quality of the explanation. The models are more likely to be swayed if the input is presented as being authoritative or confident, but the effect is small in size. This study underscores the significant prompt-sensitivity of LLMs and highlights the potential risks of incorporating outputs from external sources without thorough scrutiny and further validation. As LLMs continue to advance, understanding and mitigating such sensitivities will be crucial for their reliable and trustworthy deployment.

arxiv preprint arxiv, correct answer, explanation, (14 more...)

arXiv.org Artificial Intelligence

2408.11865

Country:

South America > Brazil (0.04)
North America > United States > California > Alameda County > Berkeley (0.04)
Europe > Switzerland > Zürich > Zürich (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (1.00)

Industry:

Materials > Chemicals > Commodity Chemicals (1.00)
Energy > Renewable > Biofuel > Ethanol (1.00)
Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback

With AI, We Are All Once Again Tech Companies' Guinea Pigs

#artificialintelligenceFeb-26-2023, 08:15:15 GMT

The companies touting new chat-based artificial-intelligence systems are running a massive experiment--and we are the test subjects. In this experiment, Microsoft, OpenAI and others are rolling out on the internet an alien intelligence that no one really understands, which has been granted the ability to influence our assessment of what's true in the world.

experiment, guinea pig, tech company

#artificialintelligence

Industry: Information Technology (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.41)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.41)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.41)

Add feedback

For Chat-Based AI, We Are All Once Again Tech Companies' Guinea Pigs - WSJ

#artificialintelligenceFeb-25-2023, 06:15:29 GMT

The companies touting new chat-based artificial-intelligence systems are running a massive experiment--and we are the test subjects. In this experiment, Microsoft, OpenAI and others are rolling out on the internet an alien intelligence that no one really understands, which has been granted the ability to influence our assessment of what's true in the world.

chat-based ai, guinea pig, tech company, (2 more...)

#artificialintelligence

Industry: Information Technology (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.86)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.86)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.41)

Add feedback

'Yeah, we're spooked': AI starting to have big real-world impact, says expert

#artificialintelligenceNov-2-2021, 17:15:07 GMT

A scientist who wrote a leading textbook on artificial intelligence has said experts are "spooked" by their own success in the field, comparing the advance of AI to the development of the atom bomb. Prof Stuart Russell, the founder of the Center for Human-Compatible Artificial Intelligence at the University of California, Berkeley, said most experts believed that machines more intelligent than humans would be developed this century, and he called for international treaties to regulate the development of the technology. "The AI community has not yet adjusted to the fact that we are now starting to have a really big impact in the real world," he told the Guardian. "That simply wasn't the case for most of the history of the field – we were just in the lab, developing things, trying to get stuff to work, mostly failing to get stuff to work. So the question of real-world impact was just not germane at all. And we have to grow up very quickly to catch up."

artificial intelligence, big real-world impact, real-world impact, (5 more...)

#artificialintelligence

Country: North America > United States > California > Alameda County > Berkeley (0.25)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

'Yeah, we're spooked': AI starting to have big real-world impact, says expert

The GuardianOct-29-2021, 15:00:32 GMT

A scientist who wrote a leading textbook on artificial intelligence has said experts are "spooked" by their own success in the field, comparing the advance of AI to the development of the atom bomb. Prof Stuart Russell, the founder of the Center for Human-Compatible Artificial Intelligence at the University of California, Berkeley, said most experts believed that machines more intelligent than humans would be developed this century, and he called for international treaties to regulate the development of the technology. "The AI community has not yet adjusted to the fact that we are now starting to have a really big impact in the real world," he told the Guardian. "That simply wasn't the case for most of the history of the field – we were just in the lab, developing things, trying to get stuff to work, mostly failing to get stuff to work. So the question of real-world impact was just not germane at all. And we have to grow up very quickly to catch up."

artificial intelligence, big real-world impact, real-world impact, (5 more...)

The Guardian

Country: North America > United States > California > Alameda County > Berkeley (0.25)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

Quantifying the Conceptual Error in Dimensionality Reduction

Hanika, Tom, Hirth, Johannes

arXiv.org Artificial IntelligenceJun-12-2021

Dimension reduction of data sets is a standard problem in the realm of machine learning and knowledge reasoning. They affect patterns in and dependencies on data dimensions and ultimately influence any decision-making processes. Therefore, a wide variety of reduction procedures are in use, each pursuing different objectives. A so far not considered criterion is the conceptual continuity of the reduction mapping, i.e., the preservation of the conceptual structure with respect to the original data set. Based on the notion scale-measure from formal concept analysis we present in this work a) the theoretical foundations to detect and quantify conceptual errors in data scalings; b) an experimental investigation of our approach on eleven data sets that were respectively treated with a variant of non-negative matrix factorization.

conceptual scaling error, ext, lattice, (14 more...)

arXiv.org Artificial Intelligence

2106.06815

Country:

Asia > Middle East > Republic of Türkiye (0.05)
Asia > Indonesia > Bali (0.05)
Europe > Germany (0.04)
(3 more...)

Genre: Research Report (0.64)

Industry: Health & Medicine (0.31)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Dimensionality Reduction (0.40)

Add feedback

Federal investigators warn Tesla is using customers as 'guinea pigs' to test its 'Full Self-Driving'

Daily Mail - Science & techMar-15-2021, 21:55:44 GMT

The National Transport Safety Board (NTSB) suggests Tesla is using customers as'guinea pigs' to test its autonomous driving technology before it is officially approved and is blaming its sister agency for letting it happen. In a letter to the National Highway Traffic Safety Administration (NHTSA), NTSB is calling for stricter requirements for design and use automated driving systems on public roads, CNBC reports. Tesla is named 16 times in the document, mainly due to the fact it released its'Full Self-Driving' FSD) beta version to the public'with limited oversight or reporting requirements.' Although NTSB points to the Elon Musk-owned firm for its lack of safeguarding, the agency is also slamming NHTSA for its'hands-off approach' to monitor such testing on public roads. Tesla first launched its FSD beta program in October to a limited number of customers who were deemed'expert and careful drivers.'

customer, nhtsa, requirement, (14 more...)

Daily Mail - Science & tech

Country: North America > United States > California > Santa Clara County > Mountain View (0.06)

Industry:

Transportation > Ground > Road (1.00)
Government > Regional Government > North America Government > United States Government (1.00)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)

Add feedback

'I choose to thrive': the man fighting motor neurone disease with cyborg technology

The GuardianAug-16-2020, 09:00:08 GMT

In November 2017, Peter B Scott-Morgan received the news that almost nothing can prepare you for – he was told he had just two years to live. Peter had been diagnosed with motor neurone disease (MND). It kills a third of those who have it within a year, rising to a half by the end of year two, with no known cure. Devastated as Peter was, he'd already decided this was negotiable. Fortunately, long before his own diagnosis, he had been fascinated by the idea of harnessing the power of modern technology to prolong human life.

artificial intelligence, francis, motor neurone disease, (16 more...)

The Guardian

Country:

Europe > United Kingdom > England > Greater London > London > Wimbledon (0.04)
Europe > Netherlands > South Holland > Rotterdam (0.04)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

Tech-savvy turn out to be most leery of self-driving cars

#artificialintelligenceOct-7-2019, 09:08:17 GMT

Karen Brenchley is a computer scientist with expertise in training artificial intelligence, but the longtime Silicon Valley resident has pangs of anxiety whenever she sees Waymo self-driving cars maneuver the streets near her home. The former product manager, who has worked for Microsoft and Hewlett-Packard, wonders how engineers could teach the robocars operating on her tree-lined streets to make snap decisions, speed and slow with the flow of traffic, and yield to pedestrians walking from the park. She has asked her husband, an award-winning science-fiction author who doesn't drive, to wear a shiny vest while cycling to ensure that autonomous vehicles spot him in a rush of activity. The problem isn't that she doesn't understand the technology. It's that she does, and she knows how flawed nascent technology can be. "I'm not skeptical long-term," said Brenchley, who has lived in Silicon Valley for 30 years.

resident, self-driving car, vehicle, (16 more...)

#artificialintelligence

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.07)
North America > United States > California > Santa Clara County > Sunnyvale (0.05)
North America > United States > Arizona (0.05)
(2 more...)

Industry:

Transportation > Passenger (1.00)
Transportation > Ground > Road (1.00)
Information Technology (1.00)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)

Add feedback

Tech-savvy residents go nimby on self-driving cars

#artificialintelligenceOct-5-2019, 14:37:52 GMT

KAREN Brenchley is a computer scientist with expertise in training artificial intelligence, but this longtime Silicon Valley resident has pangs of anxiety whenever she sees Waymo self-driving cars manoeuvre the streets near her home. The former product manager, who has worked for Microsoft and Hewlett-Packard, wonders how engineers could teach the robocars operating on her tree-lined streets to make snap decisions, speed and slow with the flow of traffic and yield to pedestrians coming from the nearby park. She has asked her husband, an award-winning science-fiction author who does not drive, to wear a shiny vest while cycling to ensure autonomous vehicles spot him in a rush of activity. The problem is not that she does not understand the technology. It is that she does, and she knows how flawed nascent technology can be.

artificial intelligence, resident, vehicle, (16 more...)

#artificialintelligence

Country: