Goto

Collaborating Authors

 conversational robot


Leveraging Large Language Models for Robot-Assisted Learning of Morphological Structures in Preschool Children with Language Vulnerabilities

arXiv.org Artificial Intelligence

Preschool children with language vulnerabilities -- such as developmental language disorders or immigration related language challenges -- often require support to strengthen their expressive language skills. Based on the principle of implicit learning, speech-language therapists (SLTs) typically embed target morphological structures (e.g., third person -s) into everyday interactions or game-based learning activities. Educators are recommended by SLTs to do the same. This approach demands precise linguistic knowledge and real-time production of various morphological forms (e.g., "Daddy wears these when he drives to work"). The task becomes even more demanding when educators or parent also must keep children engaged and manage turn-taking in a game-based activity. In the TalBot project our multiprofessional team have developed an application in which the Furhat conversational robot plays the word retrieval game "Alias" with children to improve language skills. Our application currently employs a large language model (LLM) to manage gameplay, dialogue, affective responses, and turn-taking. Our next step is to further leverage the capacity of LLMs so the robot can generate and deliver specific morphological targets during the game. We hypothesize that a robot could outperform humans at this task. Novel aspects of this approach are that the robot could ultimately serve as a model and tutor for both children and professionals and that using LLM capabilities in this context would support basic communication needs for children with language vulnerabilities. Our long-term goal is to create a robust LLM-based Robot-Assisted Language Learning intervention capable of teaching a variety of morphological structures across different languages.


Interruption Handling for Conversational Robots

arXiv.org Artificial Intelligence

Interruptions, a fundamental component of human communication, can enhance the dynamism and effectiveness of conversations, but only when effectively managed by all parties involved. Despite advancements in robotic systems, state-of-the-art systems still have limited capabilities in handling user-initiated interruptions in real-time. Prior research has primarily focused on post hoc analysis of interruptions. To address this gap, we present a system that detects user-initiated interruptions and manages them in real-time based on the interrupter's intent (i.e., cooperative agreement, cooperative assistance, cooperative clarification, or disruptive interruption). The system was designed based on interaction patterns identified from human-human interaction data. We integrated our system into an LLM-powered social robot and validated its effectiveness through a timed decision-making task and a contentious discussion task with 21 participants. Our system successfully handled 93.69% (n=104/111) of user-initiated interruptions. We discuss our learnings and their implications for designing interruption-handling behaviors in conversational robots.


Dobby: A Conversational Service Robot Driven by GPT-4

arXiv.org Artificial Intelligence

This work introduces a robotics platform which embeds a conversational AI agent in an embodied system for natural language understanding and intelligent decision-making for service tasks; integrating task planning and human-like conversation. The agent is derived from a large language model, which has learned from a vast corpus of general knowledge. In addition to generating dialogue, this agent can interface with the physical world by invoking commands on the robot; seamlessly merging communication and behavior. This system is demonstrated in a free-form tour-guide scenario, in an HRI study combining robots with and without conversational AI capabilities. Performance is measured along five dimensions: overall effectiveness, exploration abilities, scrutinization abilities, receptiveness to personification, and adaptability.


Alzheimer's Dementia Detection through Spontaneous Dialogue with Proactive Robotic Listeners

arXiv.org Artificial Intelligence

As the aging of society continues to accelerate, Alzheimer's Disease (AD) has received more and more attention from not only medical but also other fields, such as computer science, over the past decade. Since speech is considered one of the effective ways to diagnose cognitive decline, AD detection from speech has emerged as a hot topic. Nevertheless, such approaches fail to tackle several key issues: 1) AD is a complex neurocognitive disorder which means it is inappropriate to conduct AD detection using utterance information alone while ignoring dialogue information; 2) Utterances of AD patients contain many disfluencies that affect speech recognition yet are helpful to diagnosis; 3) AD patients tend to speak less, causing dialogue breakdown as the disease progresses. This fact leads to a small number of utterances, which may cause detection bias. Therefore, in this paper, we propose a novel AD detection architecture consisting of two major modules: an ensemble AD detector and a proactive listener. This architecture can be embedded in the dialogue system of conversational robots for healthcare.


Sharing a Laugh: Scientists Teach a Robot When to Have a Sense of Humor - Neuroscience News

#artificialintelligence

Summary: Researchers have designed a shared-laughter AI system that responds to human laughter in order to build a sense of empathy into dialogue. Since at least the time of inquiring minds like Plato, philosophers and scientists have puzzled over the question, "What's so funny?" The Greeks attributed the source of humor to feeling superior at the expense of others. German psychoanalyst Sigmund Freud believed humor was a way to release pent-up energy. US comedian Robin Williams tapped his anger at the absurd to make people laugh.


eXtended Artificial Intelligence: New Prospects of Human-AI Interaction Research

arXiv.org Artificial Intelligence

Artificial Intelligence (AI) covers a broad spectrum of computational problems and use cases. Many of those implicate profound and sometimes intricate questions of how humans interact or should interact with AIs. Moreover, many users or future users do have abstract ideas of what AI is, significantly depending on the specific embodiment of AI applications. Human-centered-design approaches would suggest evaluating the impact of different embodiments on human perception of and interaction with AI. An approach that is difficult to realize due to the sheer complexity of application fields and embodiments in reality. However, here XR opens new possibilities to research human-AI interactions. The article's contribution is twofold: First, it provides a theoretical treatment and model of human-AI interaction based on an XR-AI continuum as a framework for and a perspective of different approaches of XR-AI combinations. It motivates XR-AI combinations as a method to learn about the effects of prospective human-AI interfaces and shows why the combination of XR and AI fruitfully contributes to a valid and systematic investigation of human-AI interactions and interfaces. Second, the article provides two exemplary experiments investigating the aforementioned approach for two distinct AI-systems. The first experiment reveals an interesting gender effect in human-robot interaction, while the second experiment reveals an Eliza effect of a recommender system. Here the article introduces two paradigmatic implementations of the proposed XR testbed for human-AI interactions and interfaces and shows how a valid and systematic investigation can be conducted. In sum, the article opens new perspectives on how XR benefits human-centered AI design and development.


Improvement of humanlike conversations in humanoid robots: Development of a child-like android with the ability to move

#artificialintelligence

Firstly, a multimodal recognition system utilizing the camera, microphone array, etc. was developed. Next, in order to set a technological foundation to facilitate the interaction of the robot with the human, a conversation control system was developed that can control the speech, motion, gaze, and emotion of the robot based on its intention and desire towards making the human feel more human-like existence of the robot during the interaction. Although the experiment for the verification of the system was conducted for a short period of time, including having conversation with a visitor in a waiting room; it has proved that the android "ERICA" is able to conduct natural conversation and increase the perceived existence of the robot by the human, which are less likely to be achieved by using the other well-known robots. Furthermore, by using some novel technologies such as the implementation of natural and various types of nodding during the interaction, asking in return with analyzing the linguistical focus terms of the interaction sentence, and the implementation of the reaction detection mechanism, a conversation system was developed for the robot which has resulted in more human-like sense of conversing. Adopting this system in an experiment in which human participants were asked to have a conversation with the robot, and the human participants were interviewed by an interviewer during and after the experiment, a successful induction of the human was approved in speaking with the robot and continuing a human-like conversation for a long period of time; compared to the well-known smart speaker-based systems.


New advance with conversational robots

#artificialintelligence

The new research stems from a symbiotic human-robot interaction project from the Japan Science and Technology Agency. This is based around the development of a multimodal conversation control system together with a multi-robot conversation control system. These have been designed to create a robot that possess a much higher degree of human-like presence than any comparable robot today. A secondary aim is to create a robot with a'sense of conversing'. This outcome of the project is to design a new generation of conversational robots, starting with a child-like android dubbed'ibuki'.


Exploring Implicit Feedback for Open Domain Conversation Generation

AAAI Conferences

User feedback can be an effective indicator to the success of the human-robot conversation. However, to avoid to interrupt the online real-time conversation process, explicit feedback is usually gained at the end of a conversation. Alternatively, users' responses usually contain their implicit feedback, such as stance, sentiment, emotion, etc., towards the conversation content or the interlocutors. Therefore, exploring the implicit feedback is a natural way to optimize the conversation generation process. In this paper, we propose a novel reward function which explores the implicit feedback to optimize the future reward of a reinforcement learning based neural conversation model. A simulation strategy is applied to explore the state-action space in training and test. Experimental results show that the proposed approach outperforms the Seq2Seq model and the state-of-the-art reinforcement learning model for conversation generation on automatic and human evaluations on the OpenSubtitles and Twitter datasets.