AITopics | miscommunication

Collaborating Authors

miscommunication

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

The Decrypto Benchmark for Multi-Agent Reasoning and Theory of Mind

Lupu, Andrei, Willi, Timon, Foerster, Jakob

arXiv.org Artificial IntelligenceJun-26-2025

As Large Language Models (LLMs) gain agentic abilities, they will have to navigate complex multi-agent scenarios, interacting with human users and other agents in cooperative and competitive settings. This will require new reasoning skills, chief amongst them being theory of mind (ToM), or the ability to reason about the "mental" states of other agents. However, ToM and other multi-agent abilities in LLMs are poorly understood, since existing benchmarks suffer from narrow scope, data leakage, saturation, and lack of interactivity. We thus propose Decrypto, a game-based benchmark for multi-agent reasoning and ToM drawing inspiration from cognitive science, computational pragmatics and multi-agent reinforcement learning. It is designed to be as easy as possible in all other dimensions, eliminating confounding factors commonly found in other benchmarks. To our knowledge, it is also the first platform for designing interactive ToM experiments. We validate the benchmark design through comprehensive empirical evaluations of frontier LLMs, robustness studies, and human-AI cross-play experiments. We find that LLM game-playing abilities lag behind humans and simple word-embedding baselines. We then create variants of two classic cognitive science experiments within Decrypto to evaluate three key ToM abilities. Surprisingly, we find that state-of-the-art reasoning models are significantly worse at those tasks than their older counterparts. This demonstrates that Decrypto addresses a crucial gap in current reasoning and ToM evaluations, and paves the path towards better artificial agents.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2506.20664

Country: Europe (0.45)

Genre: Research Report > Experimental Study (0.67)

Industry:

Leisure & Entertainment > Games (0.67)
Media (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

Why Robots Are Bad at Detecting Their Mistakes: Limitations of Miscommunication Detection in Human-Robot Dialogue

Janssens, Ruben, De Bock, Jens, Labat, Sofie, Verhelst, Eva, Hoste, Veronique, Belpaeme, Tony

arXiv.org Artificial IntelligenceJun-26-2025

-- Detecting miscommunication in human-robot interaction is a critical function for maintaining user engagement and trust. While humans effortlessly detect communication errors in conversations through both verbal and non-verbal cues, robots face significant challenges in interpreting nonverbal feedback, despite advances in computer vision for recognizing affective expressions. This research evaluates the effectiveness of machine learning models in detecting miscom-munications in robot dialogue. Using a multi-modal dataset of 240 human-robot conversations, where four distinct types of conversational failures were systematically introduced, we assess the performance of state-of-the-art computer vision models. After each conversational turn, users provided feedback on whether they perceived an error, enabling an analysis of the models' ability to accurately detect robot mistakes. Despite using state-of-the-art models, the performance barely exceeds random chance in identifying miscommunication, while on a dataset with more expressive emotional content, they successfully identified confused states. T o explore the underlying cause, we asked human raters to do the same. They could also only identify around half of the induced miscommunications, similarly to our model. These results uncover a fundamental limitation in identifying robot miscommunications in dialogue: even when users perceive the induced miscommunication as such, they often do not communicate this to their robotic conversation partner . This knowledge can shape expectations of the performance of computer vision models and can help researchers to design better human-robot conversations by deliberately eliciting feedback where needed. In dialogue, individuals do more than merely interpret their interlocutors' words; they also seek feedback regarding the ongoing interaction.

artificial intelligence, expression, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2506.20268

Country: Europe > Belgium (0.28)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Vision > Face Recognition (1.00)
Information Technology > Artificial Intelligence > Robots > Humanoid Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

CoPrUS: Consistency Preserving Utterance Synthesis towards more realistic benchmark dialogues

Steindl, Sebastian, Schäfer, Ulrich, Ludwig, Bernd

arXiv.org Artificial IntelligenceDec-10-2024

Large-scale Wizard-Of-Oz dialogue datasets have enabled the training of deep learning-based dialogue systems. While they are successful as benchmark datasets, they lack certain types of utterances, which would make them more realistic. In this work, we investigate the creation of synthetic communication errors in an automatic pipeline. Based on linguistic theory, we propose and follow a simple error taxonomy. We focus on three types of miscommunications that could happen in real-world dialogues but are underrepresented in the benchmark dataset: misunderstandings, non-understandings and vaguely related questions. Our two-step approach uses a state-of-the-art Large Language Model (LLM) to first create the error and secondly the repairing utterance. We perform Language Model-based evaluation to ensure the quality of the generated utterances. We apply the method to the MultiWOZ dataset and evaluate it both qualitatively and empirically as well as with human judges. Our results indicate that current LLMs can aid in adding post-hoc miscommunications to benchmark datasets as a form of data augmentation. We publish the resulting dataset, in which nearly 1900 dialogues have been modified, as CoPrUS-MultiWOZ to facilitate future work on dialogue systems.

large language model, machine learning, utterance, (21 more...)

arXiv.org Artificial Intelligence

2412.07515

Country:

North America > Canada > Ontario > Toronto (0.04)
Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
Asia > Thailand > Bangkok > Bangkok (0.04)
(9 more...)

Genre: Research Report > New Finding (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.94)

Add feedback

No that's not what I meant: Handling Third Position Repair in Conversational Question Answering

Balaraman, Vevake, Eshghi, Arash, Konstas, Ioannis, Papaioannou, Ioannis

arXiv.org Artificial IntelligenceJul-31-2023

The ability to handle miscommunication is crucial to robust and faithful conversational AI. People usually deal with miscommunication immediately as they detect it, using highly systematic interactional mechanisms called repair. One important type of repair is Third Position Repair (TPR) whereby a speaker is initially misunderstood but then corrects the misunderstanding as it becomes apparent after the addressee's erroneous response. Here, we collect and publicly release Repair-QA, the first large dataset of TPRs in a conversational question answering (QA) setting. The data is comprised of the TPR turns, corresponding dialogue contexts, and candidate repairs of the original turn for execution of TPRs. We demonstrate the usefulness of the data by training and evaluating strong baseline models for executing TPRs. For stand-alone TPR execution, we perform both automatic and human evaluations on a fine-tuned T5 model, as well as OpenAI's GPT-3 LLMs. Additionally, we extrinsically evaluate the LLMs' TPR processing capabilities in the downstream conversational QA task. The results indicate poor out-of-the-box performance on TPR's by the GPT-3 models, which then significantly improves when exposed to Repair-QA.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2307.16689

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
Asia > India (0.05)
(12 more...)

Genre: Research Report (0.40)

Industry: Leisure & Entertainment > Sports (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.92)

Add feedback

Why Implementing RPA in your Revenue Cycle is Crucial?

#artificialintelligenceMar-25-2023, 19:33:08 GMT

Are you tired of spending countless hours on mundane, repetitive tasks that drain your energy and hinder your productivity? Do you wish there was a way to streamline your operations and reduce errors while freeing up your time to focus on growing your business? Look no further than Robotic Process Automation (RPA)! RPA is a technology that uses software robots to automate tedious and time-consuming tasks, freeing up valuable resources and improving overall efficiency. As a Healthcare Revenue Cycle Business Owner, you can benefit from RPA in several ways.

implementing rpa, patient experience, rpa, (11 more...)

#artificialintelligence

Country:

North America > United States (0.06)
Asia > Middle East > UAE (0.06)
Asia > India (0.06)

Industry: Health & Medicine > Health Care Providers & Services (0.53)

Technology: Information Technology > Artificial Intelligence > Robots (0.95)

Add feedback

Trump Crony Proves Widespread Voter Fraud Doesn't Exist

SlateMay-30-2018, 22:40:07 GMT

Did voter fraud swing New Hampshire away from Donald Trump in the 2016 election? Absolutely not, according to an exhaustive investigation conducted by the state's attorney general and secretary of state, which, counter to Trump's persistent allegations, turned up no evidence of "serious voter fraud." Instead, the inquiry provided further evidence that the tools Republicans use to detect voter fraud are fatally flawed, churning out a huge number of false positives. And while the New Hampshire investigation ultimately debunked Trump's paranoia, it came perilously close to disenfranchising thousands of lawful voters. Republicans have seized upon New Hampshire as the putative epicenter of American voter fraud for two reasons.

artificial intelligence, machine learning, voter, (16 more...)

Slate

Country:

North America > United States > New Hampshire (1.00)
North America > United States > Massachusetts (0.07)
North America > United States > Vermont (0.05)
North America > United States > Maine (0.05)

Industry:

Law > Government & the Courts (1.00)
Government > Regional Government > North America Government > United States Government (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.52)

Add feedback

How Not To Lie With Statistics

@machinelearnbotJan-11-2018, 19:22:48 GMT

"What is truth?" and "What is a lie?" are questions that have drawn the attention of philosophers, theologians, legal scholars and intellectuals of many kinds for centuries. I am not a scholar or intellectual, merely a hardhat statistician working in marketing research and what is vaguely called data science. Regardless of what we do for a living, however, all of us are consumers of statistics at work and in our daily lives. "Statistics" can refer to figures or mathematical models, and either can be used to deceive us, are often misinterpreted or can be flat out wrong. Deception in various forms can be found in nature, and pet owners may have noticed that it is not exclusively a human trait.

artificial intelligence, social media, statistician, (15 more...)

@machinelearnbot

Technology:

Information Technology > Artificial Intelligence (1.00)
Information Technology > Communications > Social Media (0.50)

Add feedback

Emoji meanings vary hugely between platforms, meaning characters can lead to vast miscommunication, study finds

The Independent - TechApr-12-2016, 10:10:04 GMT

Nasa has announced that it has found evidence of flowing water on Mars. Scientists have long speculated that Recurring Slope Lineae -- or dark patches -- on Mars were made up of briny water but the new findings prove that those patches are caused by liquid water, which it has established by finding hydrated salts. Several hundred camped outside the London store in Covent Garden. The 6s will have new features like a vastly improved camera and a pressure-sensitive "3D Touch" display

artificial intelligence, platform, social media, (16 more...)

The Independent - Tech

Country:

North America > United States > Nevada > Clark County > Las Vegas (0.04)
North America > United States > Minnesota (0.04)
North America > United States > California > San Francisco County > San Francisco (0.04)
Europe > United Kingdom > England > Kent (0.04)

Genre: Research Report > New Finding (1.00)

Industry:

Government (0.89)
Transportation (0.72)
Leisure & Entertainment > Games > Computer Games (0.70)

Technology:

Information Technology > Artificial Intelligence (0.96)
Information Technology > Communications > Social Media (0.48)
Information Technology > Communications > Mobile (0.31)

Add feedback

Towards Overcoming Miscommunication in Situated Dialogue by Asking Questions

Marge, Matthew (Carnegie Mellon University) | Rudnicky, Alexander I. (Carnegie Mellon University)

AAAI ConferencesNov-1-2011

Situated dialogue is prominent in the robot navigation task, where a human gives route instructions (i.e., a sequence of navigation commands) to an agent. We propose an approach for situated dialogue agents whereby they use strategies such as asking questions to repair or recover from unclear instructions, namely those that an agent misunderstands or considers ambiguous. Most immediately in this work we study examples from existing human-human dialogue corpora and relate them to our proposed approach.

artificial intelligence, instruction, natural language, (17 more...)

AAAI Conferences

2011 AAAI Fall Symposium Series

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
Europe > Sweden > Stockholm > Stockholm (0.04)

Technology:

Information Technology > Artificial Intelligence > Speech (0.95)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.49)

Add feedback

Detecting, Repairing, and Preventing Human-Machine Miscommunication

McRoy, Susan

AI MagazineMar-15-1997

The next portion of the workshop was devoted to different approaches to preventing and repairing miscommunication. These sessions represent a progression between different parts of their discourse Research related to achieving from work that clarifies the model or between the discourse robust interaction is an important problem of miscommunication to model and the domain model. Early work concerned work that describes the strategies The last session was the presentation the correction of spelling or grammatical used to repair miscommunication. I of work involving deployed systems errors in a user's utterance so review the most significant issues using speech as a mode of interaction. The approaches were constrained by their have assumed that the system's model differed in two dimensions: First, experimenters impact on overall system performance is always correct.

artificial intelligence, miscommunication, natural language, (15 more...)

AI Magazine

Country:

North America > United States > Wisconsin (0.17)
North America > United States > Oregon (0.16)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.51)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.36)
Information Technology > Artificial Intelligence > Speech (0.33)
Information Technology > Artificial Intelligence > Natural Language (0.31)

Add feedback