AITopics | crewmate

Collaborating Authors

crewmate

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Among Us: A Sandbox for Measuring and Detecting Agentic Deception

Golechha, Satvik, Garriga-Alonso, Adrià

arXiv.org Artificial IntelligenceMay-19-2025

Prior studies on deception in language-based AI agents typically assess whether the agent produces a false statement about a topic, or makes a binary choice prompted by a goal, rather than allowing open-ended deceptive behavior to emerge in pursuit of a longer-term goal. To fix this, we introduce $\textit{Among Us}$, a sandbox social deception game where LLM-agents exhibit long-term, open-ended deception as a consequence of the game objectives. While most benchmarks saturate quickly, $\textit{Among Us}$ can be expected to last much longer, because it is a multi-player game far from equilibrium. Using the sandbox, we evaluate $18$ proprietary and open-weight LLMs and uncover a general trend: models trained with RL are comparatively much better at producing deception than detecting it. We evaluate the effectiveness of methods to detect lying and deception: logistic regression on the activations and sparse autoencoders (SAEs). We find that probes trained on a dataset of ``pretend you're a dishonest model: $\dots$'' generalize extremely well out-of-distribution, consistently obtaining AUROCs over 95% even when evaluated just on the deceptive statement, without the chain of thought. We also find two SAE features that work well at deception detection but are unable to steer the model to lie less. We hope our open-sourced sandbox, game logs, and probes serve to anticipate and mitigate deceptive behavior and capabilities in language-based agents.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2504.04072

Genre: Research Report > New Finding (0.34)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Among Them: A game-based framework for assessing persuasion capabilities of LLMs

Idziejczak, Mateusz, Korzavatykh, Vasyl, Stawicki, Mateusz, Chmutov, Andrii, Korcz, Marcin, Błądek, Iwo, Brzezinski, Dariusz

arXiv.org Artificial IntelligenceFeb-27-2025

The proliferation of large language models (LLMs) and autonomous AI agents has raised concerns about their potential for automated persuasion and social influence. While existing research has explored isolated instances of LLM-based manipulation, systematic evaluations of persuasion capabilities across different models remain limited. In this paper, we present an Among Us-inspired game framework for assessing LLM deception skills in a controlled environment. The proposed framework makes it possible to compare LLM models by game statistics, as well as quantify in-game manipulation according to 25 persuasion strategies from social psychology and rhetoric. Experiments between 8 popular language models of different types and sizes demonstrate that all tested models exhibit persuasive capabilities, successfully employing 22 of the 25 anticipated techniques. We also find that larger models do not provide any persuasion advantage over smaller models and that longer model outputs are negatively correlated with the number of games won. Our study provides insights into the deception capabilities of LLMs, as well as tools and data for fostering future research on the topic.

impostor, language model, persuasion technique, (16 more...)

arXiv.org Artificial Intelligence

2502.20426

Country: Europe > Poland > Greater Poland Province > Poznań (0.04)

Genre:

Research Report > Experimental Study (0.49)
Research Report > New Finding (0.46)

Industry: Leisure & Entertainment > Games > Computer Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning

Sarkar, Bidipta, Xia, Warren, Liu, C. Karen, Sadigh, Dorsa

arXiv.org Artificial IntelligenceFeb-9-2025

Communicating in natural language is a powerful tool in multi-agent settings, as it enables independent agents to share information in partially observable settings and allows zero-shot coordination with humans. However, most prior works are limited as they either rely on training with large amounts of human demonstrations or lack the ability to generate natural and useful communication strategies. In this work, we train language models to have productive discussions about their environment in natural language without any human demonstrations. We decompose the communication problem into listening and speaking. Our key idea is to leverage the agent's goal to predict useful information about the world as a dense reward signal that guides communication. Specifically, we improve a model's listening skills by training them to predict information about the environment based on discussions, and we simultaneously improve a model's speaking skills with multi-agent reinforcement learning by rewarding messages based on their influence on other agents. To investigate the role and necessity of communication in complex social settings, we study an embodied social deduction game based on Among Us, where the key question to answer is the identity of an adversarial imposter. We analyze emergent behaviors due to our technique, such as accusing suspects and providing evidence, and find that it enables strong discussions, doubling the win rates compared to standard RL. We release our code and models at https://socialdeductionllm.github.io/

machine learning, natural language, reinforcement learning, (17 more...)

arXiv.org Artificial Intelligence

2502.0606

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > California > Santa Clara County > Stanford (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
(9 more...)

Genre: Research Report (0.40)

Industry:

Leisure & Entertainment > Games (0.46)
Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.66)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

AMONGAGENTS: Evaluating Large Language Models in the Interactive Text-Based Social Deduction Game

Chi, Yizhou, Mao, Lingjun, Tang, Zineng

arXiv.org Artificial IntelligenceJul-24-2024

Strategic social deduction games serve as valuable testbeds for evaluating the understanding and inference skills of language models, offering crucial insights into social science, artificial intelligence, and strategic gaming. This paper focuses on creating proxies of human behavior in simulated environments, with Among Us utilized as a tool for studying simulated human behavior. The study introduces a text-based game environment, named AmongAgents, that mirrors the dynamics of Among Us. Players act as crew members aboard a spaceship, tasked with identifying impostors who are sabotaging the ship and eliminating the crew. Within this environment, the behavior of simulated language agents is analyzed. The experiments involve diverse game sequences featuring different configurations of Crewmates and Impostor personality archetypes. Our work demonstrates that state-of-the-art large language models (LLMs) can effectively grasp the game rules and make decisions based on the current context. This work aims to promote further exploration of LLMs in goal-oriented games with incomplete information and complex action spaces, as these settings offer valuable opportunities to assess language model performance in socially driven scenarios.

agent, crewmate, impostor, (17 more...)

arXiv.org Artificial Intelligence

2407.16521

Genre:

Personal > Interview (0.93)
Research Report (0.82)

Industry: Leisure & Entertainment > Games (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Hidden Agenda: a Social Deduction Game with Diverse Learned Equilibria

Kopparapu, Kavya, Duéñez-Guzmán, Edgar A., Matyas, Jayd, Vezhnevets, Alexander Sasha, Agapiou, John P., McKee, Kevin R., Everett, Richard, Marecki, Janusz, Leibo, Joel Z., Graepel, Thore

arXiv.org Artificial IntelligenceJan-5-2022

A key challenge in the study of multiagent cooperation is the need for individual agents not only to cooperate effectively, but to decide with whom to cooperate. This is particularly critical in situations when other agents have hidden, possibly misaligned motivations and goals. Social deduction games offer an avenue to study how individuals might learn to synthesize potentially unreliable information about others, and elucidate their true motivations. In this work, we present Hidden Agenda, a two-team social deduction game that provides a 2D environment for studying learning agents in scenarios of unknown team alignment. The environment admits a rich set of strategies for both teams. Reinforcement learning agents trained in Hidden Agenda show that agents can learn a variety of behaviors, including partnering and voting without need for communication in natural language.

agent, crewmate, impostor, (15 more...)

arXiv.org Artificial Intelligence

2201.01816

Country:

South America > Brazil > São Paulo (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Genre: Research Report (0.64)

Industry: Leisure & Entertainment > Games > Computer Games (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

The Video Game AOC Keeps Streaming Is Actually a Good Metaphor for Our Politics

SlateDec-1-2020, 20:49:24 GMT

It's a fight in the realm of public opinion, which has been poisoned by bad-faith actors whose lies are threatening to overcome the truth. That sure sounds like the political environment we inhabit, but I'm talking about Among Us, the hugely popular indie video game where disguised killers stalk a crew of astronauts as they try to repair their broken ship and escape with their lives. Among Us has become a favorite of Rep. Alexandria Ocasio-Cortez, who has now live-streamed herself playing the game twice. The most recent stream, the day after Thanksgiving, brought in an audience of more than 2 million viewers. Her youthful, relatable communication style and commitment to bringing her message into the digital realms where many conservatives have been thriving for years makes it a savvy choice.

imposter, ocasio-cortez, politics, (10 more...)

Slate

Country: North America > United States (0.16)

Industry: Leisure & Entertainment > Games > Computer Games (0.73)

Technology:

Information Technology > Communications > Social Media (0.89)
Information Technology > Artificial Intelligence > Games (0.61)

Add feedback

If a robot is conscious, is it OK to turn it off? The moral implications of building true AIs

#artificialintelligenceNov-21-2020, 10:55:26 GMT

In the "Star Trek: The Next Generation" episode "The Measure of a Man," Data, an android crew member of the Enterprise, is to be dismantled for research purposes unless Captain Picard can argue that Data deserves the same rights as a human being. Naturally the question arises: What is the basis upon which something has rights? What gives an entity moral standing? The philosopher Peter Singer argues that creatures that can feel pain or suffer have a claim to moral standing. He argues that nonhuman animals have moral standing, since they can feel pain and suffer.

consciousness, intelligence, phenomenal consciousness, (13 more...)

#artificialintelligence

Industry: Leisure & Entertainment (0.71)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Games > Chess (0.71)
Information Technology > Artificial Intelligence > Robots (0.66)

Add feedback

The 10 Best Video Games of 2020

TIME - TechNov-20-2020, 14:43:56 GMT

In a bizarre, unsettling, and oftentimes downright frightening year, video games became a port of refuge for many--be they longtime gamers, old-school veterans picking the controller back up after a break, or first-timers looking for a novel way to safely have fun or connect with friends during pandemic lockdowns. It's a small blessing, then, that it was also a banner year for excellent games to play. Here are TIME's best video games of 2020, according to our group of resident gamers, listed alphabetically. Also read TIME's lists of the 10 best fiction books of 2020 and the 100 must-read books of 2020. Nostalgia is big business right now, but reworking old joy rarely delivers that original thrill.

best video game, playstation, spider-man, (13 more...)

TIME - Tech

Country: North America > United States > New York (0.05)

Industry: Leisure & Entertainment > Games > Computer Games (1.00)

Technology: Information Technology > Artificial Intelligence > Games (0.82)

Add feedback

If a Robot Is Conscious, Is It OK to Turn It Off? The Moral Implications of Building True AIs

#artificialintelligenceNov-14-2020, 00:45:30 GMT

In the Star Trek: The Next Generation episode "The Measure of a Man," Data, an android crew member of the Enterprise, is to be dismantled for research purposes unless Captain Picard can argue that Data deserves the same rights as a human being. Naturally the question arises: What is the basis upon which something has rights? What gives an entity moral standing? The philosopher Peter Singer argues that creatures that can feel pain or suffer have a claim to moral standing. He argues that nonhuman animals have moral standing, since they can feel pain and suffer.

consciousness, intelligence, phenomenal consciousness, (14 more...)

#artificialintelligence

Industry: Leisure & Entertainment (0.71)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Games > Chess (0.71)
Information Technology > Artificial Intelligence > Robots (0.66)

Add feedback

Should a conscious robot get the same rights as a human?

#artificialintelligenceOct-29-2020, 14:55:27 GMT

In the "Star Trek: The Next Generation" episode "The Measure of a Man" Data, an android crew member of the Enterprise, is to be dismantled for research purposes unless Captain Picard can argue that Data deserves the same rights as a human being. Naturally, the question arises: What is the basis upon which something has rights? What gives an entity moral standing? The philosopher Peter Singer argues that creatures that can feel pain or suffer have a claim to moral standing. He argues that nonhuman animals have moral standing since they can feel pain and suffer.

artificial intelligence, consciousness, natural language, (13 more...)

#artificialintelligence

Industry: Leisure & Entertainment (0.71)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Games > Chess (0.71)
Information Technology > Artificial Intelligence > Robots (0.66)

Add feedback