AITopics | Gienger, Michael

Collaborating Authors

Gienger, Michael

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

SemanticScanpath: Combining Gaze and Speech for Situated Human-Robot Interaction Using LLMs

Menendez, Elisabeth, Gienger, Michael, Martínez, Santiago, Balaguer, Carlos, Belardinelli, Anna

arXiv.org Artificial IntelligenceMar-19-2025

Large Language Models (LLMs) have substantially improved the conversational capabilities of social robots. Nevertheless, for an intuitive and fluent human-robot interaction, robots should be able to ground the conversation by relating ambiguous or underspecified spoken utterances to the current physical situation and to the intents expressed non verbally by the user, for example by using referential gaze. Here we propose a representation integrating speech and gaze to enable LLMs to obtain higher situated awareness and correctly resolve ambiguous requests. Our approach relies on a text-based semantic translation of the scanpath produced by the user along with the verbal requests and demonstrates LLM's capabilities to reason about gaze behavior, robustly ignoring spurious glances or irrelevant objects. We validate the system across multiple tasks and two scenarios, showing its generality and accuracy, and demonstrate its implementation on a robotic platform, closing the loop from request interpretation to execution.

artificial intelligence, large language model, natural language, (18 more...)

arXiv.org Artificial Intelligence

2503.16548

Country:

Europe (0.28)
North America > United States (0.14)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

CCDP: Composition of Conditional Diffusion Policies with Guided Sampling

Razmjoo, Amirreza, Calinon, Sylvain, Gienger, Michael, Zhang, Fan

arXiv.org Artificial IntelligenceMar-19-2025

-- Imitation Learning offers a promising approach to learn directly from data without requiring explicit models, simulations, or detailed task definitions. During inference, actions are sampled from the learned distribution and executed on the robot. However, sampled actions may fail for various reasons, and simply repeating the sampling step until a successful action is obtained can be inefficient. In this work, we propose an enhanced sampling strategy that refines the sampling distribution to avoid previously unsuccessful actions. We demonstrate that by solely utilizing data from successful demonstrations, our method can infer recovery actions without the need for additional exploratory behavior or a high-level controller . Furthermore, we leverage the concept of diffusion model decomposition to break down the primary problem--which may require long-horizon history to manage failures--into multiple smaller, more manageable sub-problems in learning, data collection, and inference, thereby enabling the system to adapt to variable failure counts. Our approach yields a low-level controller that dynamically adjusts its sampling space to improve efficiency when prior samples fall short. We validate our method across several tasks, including door opening with unknown directions, object manipulation, and button-searching scenarios, demonstrating that our approach outperforms traditional baselines. Supplementary materials for this paper are available on our website: https://hri-eu.github.io/ccdp/. I. INTRODUCTION Recent advances in imitation learning--exemplified by methods such as Implicit Behavior Cloning [1] and diffusion/flow-matching policies [2], [3]--have demonstrated remarkable capabilities in learning and replicating complex data distributions from demonstrations.

artificial intelligence, demonstration, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2503.15386

Country:

Europe > Switzerland (0.14)
Europe > Germany (0.14)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

Efficient Symbolic Planning with Views

Hasler, Stephan, Tanneberg, Daniel, Gienger, Michael

arXiv.org Artificial IntelligenceMay-6-2024

Robotic planning systems model spatial relations in detail as these are needed for manipulation tasks. In contrast to this, other physical attributes of objects and the effect of devices are usually oversimplified and expressed by abstract compound attributes. This limits the ability of planners to find alternative solutions. We propose to break these compound attributes down into a shared set of elementary attributes. This strongly facilitates generalization between different tasks and environments and thus helps to find innovative solutions. On the down-side, this generalization comes with an increased complexity of the solution space. Therefore, as the main contribution of the paper, we propose a method that splits the planning problem into a sequence of views, where in each view only an increasing subset of attributes is considered. We show that this view-based strategy offers a good compromise between planning speed and quality of the found plan, and discuss its general applicability and limitations.

artificial intelligence, microwave, planning & scheduling, (18 more...)

arXiv.org Artificial Intelligence

2405.03307

Country: Europe (0.14)

Genre: Research Report (0.84)

Industry: Consumer Products & Services > Food, Beverage, Tobacco & Cannabis > Beverages (0.51)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)

Add feedback

Learning Deep Dynamical Systems using Stable Neural ODEs

Sochopoulos, Andreas, Gienger, Michael, Vijayakumar, Sethu

arXiv.org Artificial IntelligenceApr-16-2024

Abstract-- Learning complex trajectories from demonstrations in robotic tasks has been effectively addressed through the utilization of Dynamical Systems (DS). State-of-the-art DS learning methods ensure stability of the generated trajectories; however, they have three shortcomings: a) the DS is assumed to have a single attractor, which limits the diversity of tasks it can achieve, b) state derivative information is assumed to be available in the learning process and c) the state of the DS is assumed to be measurable at inference time. We propose a class of provably stable latent DS with possibly multiple attractors, that inherit the training methods of Neural Ordinary Differential Equations, thus, dropping the dependency on state derivative information. A diffeomorphic mapping for the output and a loss that captures time-invariant trajectory similarity are proposed. We validate the efficacy of our approach through experiments conducted on a public dataset of handwritten shapes and within a simulated object manipulation task.

artificial intelligence, machine learning, trajectory, (16 more...)

arXiv.org Artificial Intelligence

2404.10622

Country: Europe (0.14)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)

Add feedback

Generating consistent PDDL domains with Large Language Models

Smirnov, Pavel, Joublin, Frank, Ceravola, Antonello, Gienger, Michael

arXiv.org Artificial IntelligenceApr-11-2024

Large Language Models (LLMs) are capable of transforming natural language domain descriptions into plausibly looking PDDL markup. However, ensuring that actions are consistent within domains still remains a challenging task. In this paper we present a novel concept to significantly improve the quality of LLM-generated PDDL models by performing automated consistency checking during the generation process. Although the proposed consistency checking strategies still can't guarantee absolute correctness of generated models, they can serve as valuable source of feedback reducing the amount of correction efforts expected from a human in the loop. We demonstrate the capabilities of our error detection approach on a number of classical and custom planning domains (logistics, gripper, tyreworld, household, pizza).

large language model, natural language, predicate, (17 more...)

arXiv.org Artificial Intelligence

2404.07751

Country: Europe (0.14)

Genre:

Research Report (1.00)
Workflow (0.74)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Impact-Aware Bimanual Catching of Large-Momentum Objects

Yan, Lei, Stouraitis, Theodoros, Moura, João, Xu, Wenfu, Gienger, Michael, Vijayakumar, Sethu

arXiv.org Artificial IntelligenceMar-25-2024

This paper investigates one of the most challenging tasks in dynamic manipulation -- catching large-momentum moving objects. Beyond the realm of quasi-static manipulation, dealing with highly dynamic objects can significantly improve the robot's capability of interacting with its surrounding environment. Yet, the inevitable motion mismatch between the fast moving object and the approaching robot will result in large impulsive forces, which lead to the unstable contacts and irreversible damage to both the object and the robot. To address the above problems, we propose an online optimization framework to: 1) estimate and predict the linear and angular motion of the object; 2) search and select the optimal contact locations across every surface of the object to mitigate impact through sequential quadratic programming (SQP); 3) simultaneously optimize the end-effector motion, stiffness, and contact force for both robots using multi-mode trajectory optimization (MMTO); and 4) realise the impact-aware catching motion on the compliant robotic system based on indirect force controller. We validate the impulse distribution, contact selection, and impact-aware MMTO algorithms in simulation and demonstrate the benefits of the proposed framework in real-world experiments including catching large-momentum moving objects with well-defined motion, constrained motion and free-flying motion.

artificial intelligence, optimization problem, robot, (17 more...)

arXiv.org Artificial Intelligence

2403.17249

Country:

Asia > China (0.68)
Europe (0.68)
North America > United States > California (0.14)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)

Add feedback

To Help or Not to Help: LLM-based Attentive Support for Human-Robot Group Interactions

Tanneberg, Daniel, Ocker, Felix, Hasler, Stephan, Deigmoeller, Joerg, Belardinelli, Anna, Wang, Chao, Wersing, Heiko, Sendhoff, Bernhard, Gienger, Michael

arXiv.org Artificial IntelligenceMar-19-2024

How can a robot provide unobtrusive physical support within a group of humans? We present Attentive Support, a novel interaction concept for robots to support a group of humans. It combines scene perception, dialogue acquisition, situation understanding, and behavior generation with the common-sense reasoning capabilities of Large Language Models (LLMs). In addition to following user instructions, Attentive Support is capable of deciding when and how to support the humans, and when to remain silent to not disturb the group. With a diverse set of scenarios, we show and evaluate the robot's attentive behavior, which supports and helps the humans when required, while not disturbing if no help is needed.

artificial intelligence, large language model, natural language, (17 more...)

arXiv.org Artificial Intelligence

2403.12533

Country: Europe (0.14)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Large Language Models for Multi-Modal Human-Robot Interaction

Wang, Chao, Hasler, Stephan, Tanneberg, Daniel, Ocker, Felix, Joublin, Frank, Ceravola, Antonello, Deigmoeller, Joerg, Gienger, Michael

arXiv.org Artificial IntelligenceJan-26-2024

This paper presents an innovative large language model (LLM)-based robotic system for enhancing multi-modal human-robot interaction (HRI). Traditional HRI systems relied on complex designs for intent estimation, reasoning, and behavior generation, which were resource-intensive. In contrast, our system empowers researchers and practitioners to regulate robot behavior through three key aspects: providing high-level linguistic guidance, creating "atomics" for actions and expressions the robot can use, and offering a set of examples. Implemented on a physical robot, it demonstrates proficiency in adapting to multi-modal inputs and determining the appropriate manner of action to assist humans with its arms, following researchers' defined guidelines. Simultaneously, it coordinates the robot's lid, neck, and ear movements with speech output to produce dynamic, multi-modal expressions. This showcases the system's potential to revolutionize HRI by shifting from conventional, manual state-and-flow design methods to an intuitive, guidance-based, and example-driven approach.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2401.15174

Country:

Europe (0.29)
North America > United States (0.14)

Genre: Research Report (0.65)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Learning from Few Demonstrations with Frame-Weighted Motion Generation

Sun, Jianyong, Kober, Jens, Gienger, Michael, Zhu, Jihong

arXiv.org Artificial IntelligenceOct-26-2023

Learning from Demonstration (LfD) enables robots to acquire versatile skills by learning motion policies from human demonstrations. It endows users with an intuitive interface to transfer new skills to robots without the need for time-consuming robot programming and inefficient solution exploration. During task executions, the robot motion is usually influenced by constraints imposed by environments. In light of this, task-parameterized LfD (TP-LfD) encodes relevant contextual information into reference frames, enabling better skill generalization to new situations. However, most TP-LfD algorithms typically require multiple demonstrations across various environmental conditions to ensure sufficient statistics for a meaningful model. It is not a trivial task for robot users to create different situations and perform demonstrations under all of them. Therefore, this paper presents a novel algorithm to learn skills from few demonstrations. By leveraging the reference frame weights that capture the frame importance or relevance during task executions, our method demonstrates excellent skill acquisition performance, which is validated in real robotic environments.

artificial intelligence, demonstration, reference frame, (13 more...)

arXiv.org Artificial Intelligence

2303.14188

Country:

Europe > Netherlands (0.14)
Europe > United Kingdom (0.14)

Genre: Research Report (0.40)

Industry: Education (0.48)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback

Robotic Fabric Flattening with Wrinkle Direction Detection

Qiu, Yulei, Zhu, Jihong, Della Santina, Cosimo, Gienger, Michael, Kober, Jens

arXiv.org Artificial IntelligenceOct-26-2023

Deformable Object Manipulation (DOM) is an important field of research as it contributes to practical tasks such as automatic cloth handling, cable routing, surgical operation, etc. Perception is considered one of the major challenges in DOM due to the complex dynamics and high degree of freedom of deformable objects. In this paper, we develop a novel image-processing algorithm based on Gabor filters to extract useful features from cloth, and based on this, devise a strategy for cloth flattening tasks. We also evaluate the overall framework experimentally and compare it with three human operators. The results show that our algorithm can determine the direction of wrinkles on the cloth accurately in simulation as well as in real robot experiments. Furthermore, our dewrinkling strategy compares favorably to baseline methods.

artificial intelligence, operation point, wrinkle, (13 more...)

arXiv.org Artificial Intelligence

2303.04909

Country: Europe > Netherlands (0.14)

Genre: Research Report > New Finding (0.66)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback