AITopics

2507.00079

Country: North America > United States (0.04)

Genre: Research Report (0.40)

Industry:

Materials > Metals & Mining (0.46)
Leisure & Entertainment > Games > Computer Games (0.36)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Vidler, Alicia, Walsh, Toby

Playing games with Large language models: Randomness and strategy

arXiv.org Artificial IntelligenceMar-4-2025

Playing games has a long history of describing intricate interactions in simplified forms. In this paper we explore if large language models (LLMs) can play games, investigating their capabilities for randomisation and strategic adaptation through both simultaneous and sequential game interactions. We focus on GPT-4o-Mini-2024-08-17 and test two games between LLMs: Rock Paper Scissors (RPS) and games of strategy (Prisoners Dilemma PD). LLMs are often described as stochastic parrots, and while they may indeed be parrots, our results suggest that they are not very stochastic in the sense that their outputs - when prompted to be random - are often very biased. Our research reveals that LLMs appear to develop loss aversion strategies in repeated games, with RPS converging to stalemate conditions while PD shows systematic shifts between cooperative and competitive outcomes based on prompt design. We detail programmatic tools for independent agent interactions and the Agentic AI challenges faced in implementation. We show that LLMs can indeed play games, just not very well. These results have implications for the use of LLMs in multi-agent LLM systems and showcase limitations in current approaches to model output for strategic decision-making.

language model, llm, scissors, (16 more...)

2503.02582

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.90)

Yuan, Jiaqing, Murukannaiah, Pradeep K., Singh, Munindar P.

Right vs. Right: Can LLMs Make Tough Choices?

arXiv.org Artificial IntelligenceDec-27-2024

An ethical dilemma describes a choice between two "right" options involving conflicting moral values. We present a comprehensive evaluation of how LLMs navigate ethical dilemmas. Specifically, we investigate LLMs on their (1) sensitivity in comprehending ethical dilemmas, (2) consistency in moral value choice, (3) consideration of consequences, and (4) ability to align their responses to a moral value preference explicitly or implicitly specified in a prompt. Drawing inspiration from a leading ethical framework, we construct a dataset comprising 1,730 ethical dilemmas involving four pairs of conflicting values. We evaluate 20 well-known LLMs from six families. Our experiments reveal that: (1) LLMs exhibit pronounced preferences between major value pairs, and prioritize truth over loyalty, community over individual, and long-term over short-term considerations. (2) The larger LLMs tend to support a deontological perspective, maintaining their choices of actions even when negative consequences are specified. (3) Explicit guidelines are more effective in guiding LLMs' moral choice than in-context examples. Lastly, our experiments highlight the limitation of LLMs in comprehending different formulations of ethical dilemmas.

large language model, machine learning, natural language, (18 more...)

2412.19926

Country: North America > United States > Minnesota (0.28)

Genre: Research Report > New Finding (0.93)

Industry: Law (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.73)

arXiv.org Artificial IntelligenceAug-19-2024

Development of an AI Anti-Bullying System Using Large Language Model Key Topic Detection

Tassava, Matthew, Kolodjski, Cameron, Milbrath, Jordan, Bishop, Adorah, Flanders, Nathan, Fetsch, Robbie, Hanson, Danielle, Straub, Jeremy

It has become a pronounced problem due to the increasing ubiquity of online platforms that provide a means to conduct it. A significant amount of this cyberbullying is conducted by and targets teenagers. It is difficult for teenage students to shut themselves off from the digital world in which the cyberbullying is taking place. Given how entrenched the use of digital apps is by today's youth, and the pronounced consequences of it - including victim self-harm, in some cases - cyberbullying is at least as much of a threat as physical bullying. Additionally, because of the obfuscation caused by the online environment, authorities (such as parents, teachers and law enforcement) may have difficulty determining what has occurred and who the actors participating are.

json object, only respond, system prompt, (14 more...)

2408.10417

Country:

Africa (0.04)
Oceania > New Zealand (0.04)
Europe > Italy > Tuscany (0.04)
(9 more...)

Genre: Research Report (0.63)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Information Technology > Security & Privacy (1.00)
Health & Medicine (1.00)
Education > Educational Setting > K-12 Education (0.45)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.83)

arXiv.org Artificial IntelligenceNov-27-2023

WorldSense: A Synthetic Benchmark for Grounded Reasoning in Large Language Models

Benchekroun, Youssef, Dervishi, Megi, Ibrahim, Mark, Gaya, Jean-Baptiste, Martinet, Xavier, Mialon, Grégoire, Scialom, Thomas, Dupoux, Emmanuel, Hupkes, Dieuwke, Vincent, Pascal

We propose WorldSense, a benchmark designed to assess the extent to which LLMs are consistently able to sustain tacit world models, by testing how they draw simple inferences from descriptions of simple arrangements of entities. Worldsense is a synthetic benchmark with three problem types, each with their own trivial control, which explicitly avoids bias by decorrelating the abstract structure of problems from the vocabulary and expressions, and by decorrelating all problem subparts with the correct response. We run our benchmark on three state-of-the-art chat-LLMs (GPT3.5, GPT4 and Llama2-chat) and show that these models make errors even with as few as three objects. Furthermore, they have quite heavy response biases, preferring certain responses irrespective of the question. Errors persist even with chain-of-thought prompting and in-context learning. Lastly, we show that while finetuning on similar problems does result in substantial improvements -- within- and out-of-distribution -- the finetuned models do not generalise beyond a constraint problem space.

benchmark, problem type, world state, (16 more...)

2311.1593

Country:

Europe > Germany > Berlin (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
North America > United States > Colorado (0.04)
(20 more...)

Genre: Research Report (0.65)

Industry: Retail (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

#artificialintelligenceOct-10-2019, 10:51:16 GMT

Why Your Business Needs True Conversational AI - IPsoft

The market is filled with automated UI solutions that claim to be enabling conversational Artificial Intelligence (AI) -- that is, AI that can interact with users through an interactive interface, one that "speaks" and reacts to human conversation in its many forms, and that can be used in a variety of business scenarios. Given the hype around the increased use of conversational assistants among consumers such as Alexa and Siri (which are not, it should be pointed out, analogous to conversational AI for business), it's understandable that enterprise decision makers might be confused, if not a bit overwhelmed, by what conversational AI could mean for their businesses, and which solution is the best for their purposes. If your company is seeking to automate human-like engagements at scale in order to make operations more efficient while maintaining and elevating customer experiences, we believe we can cut through the confusion easily. Allow us to explain why Amelia is the clear choice to enable conversational AI within your enterprise. Many digital solutions can claim to be "conversational." Indeed, humans have had the ability to converse with digital systems using regular language as far back as the 1960s.

amelia, conversational ai, engagement, (10 more...)

Industry: Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.36)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.71)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.56)

#artificialintelligenceDec-4-2018, 19:35:09 GMT

The Pivotal Differences between Artificial Intelligence and Machine Learning - TFOT

Technology and machines are evolving at a blistering pace. Whether it be multimedia devices, driverless cars, or medical advances, the world continues to evolve and change at a speed never before seen in the history of technological advances. At the nexus of these amazing leaps in understanding are the concepts of Artificial Intelligence and Machine Learning. Though they seem similar on the surface, there are some distinct differences that must be pointed out. It is the intention of this work to do just that.

artificial intelligence and machine learning, intelligence, pivotal difference, (10 more...)

Country: Europe > Croatia > Zagreb County > Zagreb (0.05)

Industry:

Information Technology (0.54)
Leisure & Entertainment > Games > Chess (0.32)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

#artificialintelligenceJun-8-2017, 15:30:13 GMT

If You Had Your Own J.A.R.V.I.S.: What Artificial Intelligence In Business Might Be Like

At OneReach, some of us think the coolest part of the Iron Man movies is the artificial intelligence that helps power Tony Stark's armor and business operations. Virtual assistant services like Magic and GoButler already exist, but requests are all managed by real people on the other end. And while there are services like My Second that incorporate artificial intelligence into their service offering, there's nothing on the level of J.A.R.V.I.S. Siri is probably the most ubiquitous example of artificially intelligent personal assistant, but she's more of a recommendation engine than true AI. Similarly, Echo, Amazon's sparkling new home automation assistant, can only respond to simple voice commands. However, Echo's functionality is beefed up once you add in the fact that Echo can connect to other apps to access their capabilities.

artificial intelligence, customer, interaction, (11 more...)

Industry: Information Technology > Smart Houses & Appliances (0.51)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)

#artificialintelligenceMay-17-2016, 19:50:43 GMT

Your TA is a robot: Georgia Tech students find out 'Jill Watson' wasn't human

Imagine discovering someone you thought was human is, in fact, a robot. It sounds like the stuff of science fiction. But that's what happened to a class full of Georgia Tech students recently, when they learned that "Jill," their teaching assistant, was actually a piece of software. CBC Radio technology columnist Dan Misener explains what happened. The story starts with a computer science professor named Ashok Goel, who teaches at the Georgia Institute of Technology.

artificial intelligence, chatbot, natural language, (13 more...)

Genre: Instructional Material > Course Syllabus & Notes (0.30)

Industry: Education > Educational Setting (0.68)

Technology:

Information Technology > Artificial Intelligence > Robots (0.69)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.32)