AITopics | Wersing, Heiko

Collaborating Authors

Wersing, Heiko

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Improving Human-Robot Teaching by Quantifying and Reducing Mental Model Mismatch

Richter, Phillip, Wersing, Heiko, Vollmer, Anna-Lisa

arXiv.org Artificial IntelligenceJan-8-2025

The rapid development of artificial intelligence and robotics has had a significant impact on our lives, with intelligent systems increasingly performing tasks traditionally performed by humans. Efficient knowledge transfer requires matching the mental model of the human teacher with the capabilities of the robot learner. This paper introduces the Mental Model Mismatch (MMM) Score, a feedback mechanism designed to quantify and reduce mismatches by aligning human teaching behavior with robot learning behavior. Using Large Language Models (LLMs), we analyze teacher intentions in natural language to generate adaptive feedback. A study with 150 participants teaching a virtual robot to solve a puzzle game shows that intention-based feedback significantly outperforms traditional performance-based feedback or no feedback. The results suggest that intention-based feedback improves instructional outcomes, improves understanding of the robot's learning process and reduces misconceptions. This research addresses a critical gap in human-robot interaction (HRI) by providing a method to quantify and mitigate discrepancies between human mental models and robot capabilities, with the goal of improving robot learning and human teaching effectiveness.

artificial intelligence, large language model, natural language, (18 more...)

arXiv.org Artificial Intelligence

2501.04755

Country:

Europe (0.28)
North America > United States (0.28)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Education (0.88)

Technology:

Information Technology > Artificial Intelligence > Robots > Humanoid Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.87)

Add feedback

The Illusion of Competence: Evaluating the Effect of Explanations on Users' Mental Models of Visual Question Answering Systems

Sieker, Judith, Junker, Simeon, Utescher, Ronja, Attari, Nazia, Wersing, Heiko, Buschmeier, Hendrik, Zarrieß, Sina

arXiv.org Artificial IntelligenceJun-27-2024

We examine how users perceive the limitations of an AI system when it encounters a task that it cannot perform perfectly and whether providing explanations alongside its answers aids users in constructing an appropriate mental model of the system's capabilities and limitations. We employ a visual question answer and explanation task where we control the AI system's limitations by manipulating the visual inputs: during inference, the system either processes full-color or grayscale images. Our goal is to determine whether participants can perceive the limitations of the system. We hypothesize that explanations will make limited AI capabilities more transparent to users. However, our results show that explanations do not have this effect. Instead of allowing users to more accurately assess the limitations of the AI system, explanations generally increase users' perceptions of the system's competence - regardless of its actual performance.

explanation, machine learning, question answering, (20 more...)

arXiv.org Artificial Intelligence

2406.1917

Country:

Europe (1.00)
North America > United States (0.93)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine (0.74)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.87)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

To Help or Not to Help: LLM-based Attentive Support for Human-Robot Group Interactions

Tanneberg, Daniel, Ocker, Felix, Hasler, Stephan, Deigmoeller, Joerg, Belardinelli, Anna, Wang, Chao, Wersing, Heiko, Sendhoff, Bernhard, Gienger, Michael

arXiv.org Artificial IntelligenceMar-19-2024

How can a robot provide unobtrusive physical support within a group of humans? We present Attentive Support, a novel interaction concept for robots to support a group of humans. It combines scene perception, dialogue acquisition, situation understanding, and behavior generation with the common-sense reasoning capabilities of Large Language Models (LLMs). In addition to following user instructions, Attentive Support is capable of deciding when and how to support the humans, and when to remain silent to not disturb the group. With a diverse set of scenarios, we show and evaluate the robot's attentive behavior, which supports and helps the humans when required, while not disturbing if no help is needed.

artificial intelligence, large language model, natural language, (17 more...)

arXiv.org Artificial Intelligence

2403.12533

Country: Europe (0.14)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

What you need to know about a learning robot: Identifying the enabling architecture of complex systems

Beierling, Helen, Richter, Phillip, Brandt, Mara, Terfloth, Lutz, Schulte, Carsten, Wersing, Heiko, Vollmer, Anna-Lisa

arXiv.org Artificial IntelligenceNov-24-2023

Nowadays, we are dealing more and more with robots and AI in everyday life. However, their behavior is not always apparent to most lay users, especially in error situations. As a result, there can be misconceptions about the behavior of the technologies in use. This, in turn, can lead to misuse and rejection by users. Explanation, for example, through transparency, can address these misconceptions. However, it would be confusing and overwhelming for users if the entire software or hardware was explained. Therefore, this paper looks at the 'enabling' architecture. It describes those aspects of a robotic system that might need to be explained to enable someone to use the technology effectively. Furthermore, this paper is concerned with the 'explanandum', which is the corresponding misunderstanding or missing concepts of the enabling architecture that needs to be clarified. We have thus developed and present an approach for determining this 'enabling' architecture and the resulting 'explanandum' of complex technologies.

artificial intelligence, human computer interaction, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2311.14431

Country:

Europe (0.46)
North America > United States (0.28)

Genre: Research Report (0.50)

Industry:

Health & Medicine (0.68)
Leisure & Entertainment > Sports (0.67)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback

Locally Adaptive Nearest Neighbors

Göpfert, Jan Philip, Wersing, Heiko, Hammer, Barbara

arXiv.org Machine LearningNov-8-2020

When training automated systems, it has been shown to be beneficial to adapt the representation of data by learning a problem-specific metric. We extend this idea and, for the widely used family of k nearest neighbors algorithms, develop a method that allows learning locally adaptive metrics. To demonstrate important aspects of how our approach works, we conduct a number of experiments on synthetic data sets, and we show its usefulness on real-world benchmark data sets. Machine learning models increasingly pervade our daily lives in the form of recommendation systems, computer vision, driver assistance, etc., challenging us to realize seamless cooperation between human and algorithmic agents. One desirable property of predictions made by machine learning models is their transparency, expressed in such a way as a statement about which factors of a given setting have the greatest influence on the decision at hand - in particular, this requirement aligns with the EU General Data Protection Regulations, which include a "right to explanation" [1].

health & medicine, matrix, oncology, (13 more...)

arXiv.org Machine Learning

2011.03904

Country:

Europe (0.47)
North America > United States > California (0.14)

Genre: Research Report (0.64)

Industry:

Information Technology > Security & Privacy (0.88)
Health & Medicine > Therapeutic Area > Oncology (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (1.00)

Add feedback

Adversarial attacks hidden in plain sight

Göpfert, Jan Philip, Wersing, Heiko, Hammer, Barbara

arXiv.org Machine LearningFeb-25-2019

The use of convolutional neural networks has led to tremendous achievements since Krizhevsky et al. [1] presented AlexNet in 2012. Despite efforts to understand the inner workings of such neural networks, they mostly remain black boxes that are hard to interpret or explain. The issue was exaggerated in 2013 when Szegedy et al. [2] showed that "adversarial examples" - images perturbed in such a way that they fool a neural network - prove that neural networks do not simply work correctly the way one might naïvely expect. Typically,such adversarial attacks change an input only slightly, but in an adversarial manner, such that humans would not regard the difference of the inputs relevant, but machines do. There are various types of attacks, such as one pixel attacks, attacks that work in the physical world, and attacks that produce inputs fooling several different neural networks without explicit knowledge of those networks [3, 4, 5].

deep learning, neural network, perturbation, (19 more...)

arXiv.org Machine Learning

1902.09286

Country: Europe (0.14)

Genre: Research Report (0.40)

Industry:

Information Technology > Security & Privacy (0.89)
Government > Military (0.75)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.91)

Add feedback

Learning Lateral Interactions for Feature Binding and Sensory Segmentation

Wersing, Heiko

Neural Information Processing SystemsDec-31-2002

We present a new approach to the supervised learning of lateral interactions for the competitive layer model (CLM) dynamic feature binding architecture. The method is based on consistency conditions, which were recently shown to characterize the attractor states of this linear threshold recurrent network. For a given set of training examples the learning problem is formulated as a convex quadratic optimization problem in the lateral interaction weights. An efficient dimension reduction of the learning problem can be achieved by using a linear superposition of basis interactions. We show the successful application of the method to a medical image segmentation problem of fluorescence microscope cell images.

health & medicine, inductive learning, interaction, (18 more...)

Neural Information Processing Systems

Country: Europe > Germany (0.14)

Industry:

Health & Medicine > Therapeutic Area (0.47)
Education > Focused Education > Special Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

Learning Lateral Interactions for Feature Binding and Sensory Segmentation

Wersing, Heiko

Neural Information Processing SystemsDec-31-2002

We present a new approach to the supervised learning of lateral interactions forthe competitive layer model (CLM) dynamic feature binding architecture. The method is based on consistency conditions, which were recently shown to characterize the attractor states of this linear threshold recurrent network. For a given set of training examples the learning problem isformulated as a convex quadratic optimization problem in the lateral interaction weights. An efficient dimension reduction of the learning problem can be achieved by using a linear superposition of basis interactions.

health & medicine, inductive learning, interaction, (18 more...)

Neural Information Processing Systems

Country: Europe > Germany (0.14)

Industry:

Health & Medicine > Therapeutic Area (0.47)
Education > Focused Education > Special Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.50)

Add feedback