icub
Would you let a humanoid play storytelling with your child? A usability study on LLM-powered narrative Human-Robot Interaction
Lombardi, Maria, Calabrese, Carmela, Ghiglino, Davide, Foglino, Caterina, De Tommaso, Davide, Da Lisca, Giulia, Natale, Lorenzo, Wykowska, Agnieszka
A key challenge in human-robot interaction research lies in developing robotic systems that can effectively perceive and interpret social cues, facilitating natural and adaptive interactions. In this work, we present a novel framework for enhancing the attention of the iCub humanoid robot by integrating advanced perceptual abilities to recognise social cues, understand surroundings through generative models, such as ChatGPT, and respond with contextually appropriate social behaviour. Specifically, we propose an interaction task implementing a narrative protocol (storytelling task) in which the human and the robot create a short imaginary story together, exchanging in turn cubes with creative images placed on them. To validate the protocol and the framework, experiments were performed to quantify the degree of usability and the quality of experience perceived by participants interacting with the system. Such a system can be beneficial in promoting effective human robot collaborations, especially in assistance, education and rehabilitation scenarios where the social awareness and the robot responsiveness play a pivotal role.
Gaze estimation learning architecture as support to affective, social and cognitive studies in natural human-robot interaction
Lombardi, Maria, Maiettini, Elisa, Wykowska, Agnieszka, Natale, Lorenzo
Gaze is a crucial social cue in any interacting scenario and drives many mechanisms of social cognition (joint and shared attention, predicting human intention, coordination tasks). Gaze direction is an indication of social and emotional functions affecting the way the emotions are perceived. Evidence shows that embodied humanoid robots endowing social abilities can be seen as sophisticated stimuli to unravel many mechanisms of human social cognition while increasing engagement and ecological validity. In this context, building a robotic perception system to automatically estimate the human gaze only relying on robot's sensors is still demanding. Main goal of the paper is to propose a learning robotic architecture estimating the human gaze direction in table-top scenarios without any external hardware. Table-top tasks are largely used in many studies in experimental psychology because they are suitable to implement numerous scenarios allowing agents to collaborate while maintaining a face-to-face interaction. Such an architecture can provide a valuable support in studies where external hardware might represent an obstacle to spontaneous human behaviour, especially in environments less controlled than the laboratory (e.g., in clinical settings). A novel dataset was also collected with the humanoid robot iCub, including images annotated from 24 participants in different gaze conditions.
Robots with Different Embodiments Can Express and Influence Carefulness in Object Manipulation
Lastrico, Linda, Garello, Luca, Rea, Francesco, Noceti, Nicoletta, Mastrogiovanni, Fulvio, Sciutti, Alessandra, Carfi, Alessandro
Humans have an extraordinary ability to communicate and read the properties of objects by simply watching them being carried by someone else. This level of communicative skills and interpretation, available to humans, is essential for collaborative robots if they are to interact naturally and effectively. For example, suppose a robot is handing over a fragile object. In that case, the human who receives it should be informed of its fragility in advance, through an immediate and implicit message, i.e., by the direct modulation of the robot's action. This work investigates the perception of object manipulations performed with a communicative intent by two robots with different embodiments (an iCub humanoid robot and a Baxter robot). We designed the robots' movements to communicate carefulness or not during the transportation of objects. We found that not only this feature is correctly perceived by human observers, but it can elicit as well a form of motor adaptation in subsequent human object manipulations. In addition, we get an insight into which motion features may induce to manipulate an object more or less carefully.
iCub Knows Where You Look: Exploiting Social Cues for Interactive Object Detection Learning
Lombardi, Maria, Maiettini, Elisa, Tikhanoff, Vadim, Natale, Lorenzo
Performing joint interaction requires constant mutual monitoring of own actions and their effects on the other's behaviour. Such an action-effect monitoring is boosted by social cues and might result in an increasing sense of agency. Joint actions and joint attention are strictly correlated and both of them contribute to the formation of a precise temporal coordination. In human-robot interaction, the robot's ability to establish joint attention with a human partner and exploit various social cues to react accordingly is a crucial step in creating communicative robots. Along the social component, an effective human-robot interaction can be seen as a new method to improve and make the robot's learning process more natural and robust for a given task. In this work we use different social skills, such as mutual gaze, gaze following, speech and human face recognition, to develop an effective teacher-learner scenario tailored to visual object learning in dynamic environments. Experiments on the iCub robot demonstrate that the system allows the robot to learn new objects through a natural interaction with a human teacher in presence of distractors.
"iCub, We Forgive You!" Investigating Trust in a Game Scenario with Kids
Cocchella, Francesca, Pusceddu, Giulia, Belgiovine, Giulia, Lastrico, Linda, Rea, Francesco, Sciutti, Alessandra
This study presents novel strategies to investigate the mutual influence of trust and group dynamics in children-robot interaction. We implemented a game-like experimental activity with the humanoid robot iCub and designed a questionnaire to assess how the children perceived the interaction. We also aim to verify if the sensors, setups, and tasks are suitable for studying such aspects. The questionnaires' results demonstrate that youths perceive iCub as a friend and, typically, in a positive way. Other preliminary results suggest that, generally, children trusted iCub during the activity and, after its mistakes, they tried to reassure it with sentences such as: "Don't worry iCub, we forgive you". Furthermore, trust towards the robot in group cognitive activity appears to change according to gender: after two consecutive mistakes by the robot, girls tended to trust iCub more than boys. Finally, no significant difference has been evidenced between different age groups across points computed from the game and the self-reported scales. The tool we proposed is suitable for studying trust in human-robot interaction (HRI) across different ages and seems appropriate to understand the mechanism of trust in group interactions.
Shared perception is different from individual perception: a new look on context dependency
Mazzola, Carlo, Rea, Francesco, Sciutti, Alessandra
Human perception is based on unconscious inference, where sensory input integrates with prior information. This phenomenon, known as context dependency, helps in facing the uncertainty of the external world with predictions built upon previous experience. On the other hand, human perceptual processes are inherently shaped by social interactions. However, how the mechanisms of context dependency are affected is to date unknown. If using previous experience - priors - is beneficial in individual settings, it could represent a problem in social scenarios where other agents might not have the same priors, causing a perceptual misalignment on the shared environment. The present study addresses this question. We studied context dependency in an interactive setting with a humanoid robot iCub that acted as a stimuli demonstrator. Participants reproduced the lengths shown by the robot in two conditions: one with iCub behaving socially and another with iCub acting as a mechanical arm. The different behavior of the robot significantly affected the use of prior in perception. Moreover, the social robot positively impacted perceptual performances by enhancing accuracy and reducing participants overall perceptual errors. Finally, the observed phenomenon has been modelled following a Bayesian approach to deepen and explore a new concept of shared perception.
Study shows how people can become quickly convinced that human-like robots can think independently
People quickly become convinced that a human-like robot is capable of independent thoughts and emotions, a new study has found. This occurs when a robot appears to act on its own beliefs and desires, rather than on what it is programmed to do. Researchers from the Italian Institute of Technology probed the response of study participants to an anthropomorphic robot called iCub. The participants completed a questionnaire before and after interacting with iCub, that was programmed to act either like a robot or in a more friendly manner. It was found that those exposed to the robot programmed to act more like a human were more likely to rate the robot's actions as intentional.
Iron Man-style robot is designed to search terrain after natural disasters
An Iron Man-style robot is designed to help out during natural disasters by wading through rubble and using its propulsion backpack to fly over difficult terrain. The robot, called iCub, has been developed by experts at the Istituto Italiano di Tecnologia (IIT) in Genoa, Italy. Robotic systems in iCub's palms will allow it to control power and direction as it zooms through the air using propulsion rockets. The reminiscent of the Iron Man armour worn by Marvel Comics character Tony Stark, played on the big screen by Robert Downey Jr. It can crawl on all fours, walk and sit up to manipulate objects.
When humans play in competition with a humanoid robot, they delay their decisions when the robot looks at them
Gaze is an extremely powerful and important signal during human-human communication and interaction, conveying intentions and informing about other's decisions. What happens when a robot and a human interact looking at each other? Researchers at IIT-Istituto Italiano di Tecnologia (Italian Institute of Technology) investigated whether a humanoid robot's gaze influences the way people reason in a social decision-making context. What they found is that a mutual gaze with a robot affects human neural activity, influencing decision-making processes, in particular delaying them. Thus, a robot gaze brings humans to perceive it as a social signal.
Cognitive architecture aided by working-memory for self-supervised multi-modal humans recognition
Gonzalez-Billandon, Jonas, Belgiovine, Giulia, Sciutti, Alessandra, Sandini, Giulio, Rea, Francesco
The ability to recognize human partners is an important social skill to build personalized and long-term human-robot interactions, especially in scenarios like education, care-giving, and rehabilitation. Faces and voices constitute two important sources of information to enable artificial systems to reliably recognize individuals. Deep learning networks have achieved state-of-the-art results and demonstrated to be suitable tools to address such a task. However, when those networks are applied to different and unprecedented scenarios not included in the training set, they can suffer a drop in performance. For example, with robotic platforms in ever-changing and realistic environments, where always new sensory evidence is acquired, the performance of those models degrades. One solution is to make robots learn from their first-hand sensory data with self-supervision. This allows coping with the inherent variability of the data gathered in realistic and interactive contexts. To this aim, we propose a cognitive architecture integrating low-level perceptual processes with a spatial working memory mechanism. The architecture autonomously organizes the robot's sensory experience into a structured dataset suitable for human recognition. Our results demonstrate the effectiveness of our architecture and show that it is a promising solution in the quest of making robots more autonomous in their learning process.