AITopics | body language

Collaborating Authors

body language

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Classification of User Satisfaction in HRI with Social Signals in the Wild

Schiffmann, Michael, Jeschke, Sabina, Richert, Anja

arXiv.org Artificial IntelligenceDec-4-2025

Socially interactive agents (SIAs) are being used in various scenarios and are nearing productive deployment. Evaluating user satisfaction with SIAs' performance is a key factor in designing the interaction between the user and SIA. Currently, subjective user satisfaction is primarily assessed manually through questionnaires or indirectly via system metrics. This study examines the automatic classification of user satisfaction through analysis of social signals, aiming to enhance both manual and autonomous evaluation methods for SIAs. During a field trial at the Deutsches Museum Bonn, a Furhat Robotics head was employed as a service and information hub, collecting an "in-the-wild" dataset. This dataset comprises 46 single-user interactions, including questionnaire responses and video data. Our method focuses on automatically classifying user satisfaction based on time series classification. We use time series of social signal metrics derived from the body pose, time series of facial expressions, and physical distance. This study compares three feature engineering approaches on different machine learning models. The results confirm the method's effectiveness in reliably identifying interactions with low user satisfaction without the need for manually annotated datasets. This approach offers significant potential for enhancing SIA performance and user experience through automated feedback mechanisms.

artificial intelligence, machine learning, natural language, (14 more...)

arXiv.org Artificial Intelligence

2512.03945

Country:

North America (0.46)
Europe (0.28)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Education (0.68)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

Add feedback

AI's Impact on Mental Health

Communications of the ACMNov-14-2025, 20:49:31 GMT

There is no doubt artificial intelligence (AI) has the potential to improve access to mental health care. "One could imagine a world where AI serves as the'front line' for mental health, providing a clearinghouse of resources and available services for individuals seeking help,'' wrote the authors of the 2023 article "The Potential Influence of AI on Population Mental Health." Targeted interventions delivered digitally through chatbots "can help reduce the population burden of mental illness, particularly in hard-to-reach populations and contexts, for example, through stepped care approaches that aim to help populations with the highest risk following natural disasters," the article states. Besides Nomi, there are an increasing number of AI platforms people are using to create chatbots to take on several roles, including that of ad hoc therapist. Yet, while AI can assist in mental health management, it cannot replace human intuition. A trained therapist observes nuances that AI can't, such as body language, tone shifts, and unspoken emotions. Chatbots can be helpful, but mental health experts stress that they should never fully replace the human experience. That said, these mainstream chatbots are frequently being used for therapeutic purposes, as opposed to chatbots designed with mental health management in mind. Industry observers say the reasons are many: They provide emotional support when people are not ready to reach out to a therapist. They are anonymous, easy to use, convenient, available anytime, safe, judgment-free, affordable, and fast. These general-purpose chatbots help by providing comfort, validation, and a safe space for users to express themselves--all without the stigma that sometimes comes with traditional therapy settings. "Talking to a therapist can be intimidating, expensive, or complicated to access, and sometimes you need someone--or something--to listen at that exact moment,'' said Stephanie Lewis, a licensed clinical social worker and executive director of Epiphany Wellness addiction and mental health treatment centers.

artificial intelligence, chatbot, natural language, (8 more...)

Communications of the ACM

Country: North America > United States > New York (0.06)

Industry: Health & Medicine > Therapeutic Area > Psychiatry/Psychology (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)

Add feedback

How Age Influences the Interpretation of Emotional Body Language in Humanoid Robots -- long paper version

Consoli, Ilaria, Mattutino, Claudio, Gena, Cristina, de Carolis, Berardina, Palestra, Giuseppe

arXiv.org Artificial IntelligenceJul-28-2025

There is a general consensus that body movements and postures provide important cues for idennullfying emonullonal states, parnullcularly when facial and vocal signals are unavailable [1]. Emonullonal Body Language (EBL) is rapidly emerging as a significant area of research within cogninullve and affecnullve neuroscience. According to De Gelder [10], numerous valuable insights into human emonullon and its neurobiological foundanullons have been derived from the study of facial expressions. Indeed certain emonullons are more effecnullvely conveyed through facial expressions, while others are benuller commun icated through body movements or a combinanullon of both. Gestures provide observable cues that can be instrumental in recognizing and interprenullng a user's emonullonal state, especially in the absence of verbal or facial signals.

artificial intelligence, emonullon, robot, (16 more...)

arXiv.org Artificial Intelligence

2507.19335

Country: Europe > Italy (0.47)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.93)

Industry:

Education (0.94)
Health & Medicine > Therapeutic Area > Neurology (0.54)

Technology: Information Technology > Artificial Intelligence > Robots > Humanoid Robots (0.67)

Add feedback

Speaking Beyond Language: A Large-Scale Multimodal Dataset for Learning Nonverbal Cues from Video-Grounded Dialogues

Kim, Youngmin, Chung, Jiwan, Kim, Jisoo, Lee, Sunghyun, Lee, Sangkyu, Kim, Junhyeok, Yang, Cheoljong, Yu, Youngjae

arXiv.org Artificial IntelligenceJun-3-2025

Nonverbal communication is integral to human interaction, with gestures, facial expressions, and body language conveying critical aspects of intent and emotion. However, existing large language models (LLMs) fail to effectively incorporate these nonverbal elements, limiting their capacity to create fully immersive conversational experiences. We introduce MARS, a multimodal language model designed to understand and generate nonverbal cues alongside text, bridging this gap in conversational AI. Our key innovation is VENUS, a large-scale dataset comprising annotated videos with time-aligned text, facial expressions, and body language. Leveraging VENUS, we train MARS with a next-token prediction objective, combining text with vector-quantized nonverbal representations to achieve multimodal understanding and generation within a unified framework. Based on various analyses of the VENUS datasets, we validate its substantial scale and high effectiveness. Our quantitative and qualitative results demonstrate that MARS successfully generates text and nonverbal languages, corresponding to conversational input.

artificial intelligence, large language model, natural language, (15 more...)

arXiv.org Artificial Intelligence

2506.00958

Genre: Research Report > New Finding (0.66)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Augmented Body Communicator: Enhancing daily body expression for people with upper limb limitations through LLM and a robotic arm

Zhou, Songchen, Armstrong, Mark, Barbareschi, Giulia, Ajioka, Toshihiro, Hu, Zheng, Ando, Ryoichi, Yoshifuji, Kentaro, Muto, Masatane, Minamizawa, Kouta

arXiv.org Artificial IntelligenceMay-12-2025

Individuals with upper limb movement limitations face challenges in interacting with others. Although robotic arms are currently used primarily for functional tasks, there is considerable potential to explore ways to enhance users' body language capabilities during social interactions. This paper introduces an Augmented Body Communicator system that integrates robotic arms and a large language model. Through the incorporation of kinetic memory, disabled users and their supporters can collaboratively design actions for the robot arm. The LLM system then provides suggestions on the most suitable action based on contextual cues during interactions. The system underwent thorough user testing with six participants who have conditions affecting upper limb mobility. Results indicate that the system improves users' ability to express themselves. Based on our findings, we offer recommendations for developing robotic arms that support disabled individuals with body language capabilities and functional tasks.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2505.05832

Country: North America > United States (1.00)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine > Therapeutic Area > Rheumatology (0.93)
Health & Medicine > Therapeutic Area > Musculoskeletal (0.93)
Health & Medicine > Therapeutic Area > Pulmonary/Respiratory Diseases (0.68)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

'It's a new world': the analysts using AI to psychologically profile elite players

The GuardianApr-19-2025, 07:00:34 GMT

Listen to any pundit's post-match reaction and you will hear variations of that soundbite. But can you analyse an athlete's state of mind, based on their on-pitch body language? In an era when football is increasingly leaning on data to demonstrate physical attributes, statistics offering an accurate indication of a player's psychological qualities, such as emotional control and leadership, are harder to come by. But Premier League clubs including Brighton are using a technique intended to help in that regard with selection and recruitment. Thomas Tuchel made headlines by telling England's players to communicate more after he evaluated their interactions during the final of Euro 2024, but counting the number of times players gesture or talk to each other on the pitch tells only part of the mental battle being played out.

amankwah, new world, psychologically profile elite player, (15 more...)

The Guardian

Country:

Europe > Germany > Bavaria > Upper Bavaria > Munich (0.06)
Europe > Norway (0.05)
Europe > Germany > Saxony > Leipzig (0.05)
Europe > Denmark (0.05)

Industry: Leisure & Entertainment > Sports > Soccer (1.00)

Technology: Information Technology > Artificial Intelligence > Applied AI (0.50)

Add feedback

AI avatar generator Synthesia does video footage deal with Shutterstock

The GuardianApr-10-2025, 08:00:22 GMT

A 2bn ( 1.6bn) British startup that uses artificial intelligence to generate realistic avatars has struck a licensing deal with the stock footage firm Shutterstock to help develop its technology. Synthesia will pay the US-based Shutterstock an undisclosed sum to use its library of corporate video footage to train its latest AI model. It expects that incorporating the clips into its model will produce even more realistic expressions, vocal tones and body language from the avatars. "Thanks to this partnership with Shutterstock, we hope to try out new approaches that will … increase the realism and expressiveness of our AI generated avatars, bringing them closer to human-like performances," said Synthesia. Synthesia uses human actors to generate digital avatars of people, which are then deployed by companies in corporate videos in a range of scenarios such as advising on cybersecurity, calculating water bills and how to communicate better at work.

avatar, shutterstock, synthesia, (7 more...)

The Guardian

Country: Europe > United Kingdom (0.19)

Industry: Government > Regional Government > Europe Government (0.35)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

Add feedback

BeMERC: Behavior-Aware MLLM-based Framework for Multimodal Emotion Recognition in Conversation

Fu, Yumeng, Wu, Junjie, Wang, Zhongjie, Zhang, Meishan, Wu, Yulin, Liu, Bingquan

arXiv.org Artificial IntelligenceMar-31-2025

Multimodal emotion recognition in conversation (MERC), the task of identifying the emotion label for each utterance in a conversation, is vital for developing empathetic machines. Current MLLM-based MERC studies focus mainly on capturing the speaker's textual or vocal characteristics, but ignore the significance of video-derived behavior information. Different from text and audio inputs, learning videos with rich facial expression, body language and posture, provides emotion trigger signals to the models for more accurate emotion predictions. In this paper, we propose a novel behavior-aware MLLM-based framework (BeMERC) to incorporate speaker's behaviors, including subtle facial micro-expression, body language and posture, into a vanilla MLLM-based MERC model, thereby facilitating the modeling of emotional dynamics during a conversation. Furthermore, BeMERC adopts a two-stage instruction tuning strategy to extend the model to the conversations scenario for end-to-end training of a MERC predictor. Experiments demonstrate that BeMERC achieves superior performance than the state-of-the-art methods on two benchmark datasets, and also provides a detailed discussion on the significance of video-derived behavior information in MERC.

emotion recognition, large language model, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2503.2399

Country: Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
Information Technology > Artificial Intelligence > Cognitive Science > Emotion (0.75)

Add feedback

SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters

Jiang, Jianping, Xiao, Weiye, Lin, Zhengyu, Zhang, Huaizhong, Ren, Tianxiang, Gao, Yang, Lin, Zhiqian, Cai, Zhongang, Yang, Lei, Liu, Ziwei

arXiv.org Artificial IntelligenceNov-29-2024

Human beings are social animals. How to equip 3D autonomous characters with similar social intelligence that can perceive, understand and interact with humans remains an open yet foundamental problem. In this paper, we introduce SOLAMI, the first end-to-end Social vision-Language-Action (VLA) Modeling framework for Immersive interaction with 3D autonomous characters. Specifically, SOLAMI builds 3D autonomous characters from three aspects: (1) Social VLA Architecture: We propose a unified social VLA framework to generate multimodal response (speech and motion) based on the user's multimodal input to drive the character for social interaction. (2) Interactive Multimodal Data: We present SynMSI, a synthetic multimodal social interaction dataset generated by an automatic pipeline using only existing motion datasets to address the issue of data scarcity. (3) Immersive VR Interface: We develop a VR interface that enables users to immersively interact with these characters driven by various architectures. Extensive quantitative experiments and user studies demonstrate that our framework leads to more precise and natural character responses (in both speech and motion) that align with user expectations with lower latency.

large language model, machine learning, natural language, (23 more...)

arXiv.org Artificial Intelligence

2412.00174

Country:

North America > United States (0.14)
North America > Montserrat (0.04)
Asia > Japan > Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.04)
Asia > Middle East > Republic of Türkiye > Batman Province > Batman (0.04)

Genre:

Research Report (1.00)
Questionnaire & Opinion Survey (0.87)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
(4 more...)

Add feedback

Love in Action: Gamifying Public Video Cameras for Fostering Social Relationships in Real World

Zhang, Zhang, Li, Da, Wu, Geng, Li, Yaoning, Sun, Xiaobing, Wang, Liang

arXiv.org Artificial IntelligenceOct-30-2024

In this paper, we create "Love in Action" (LIA), a body language-based social game utilizing video cameras installed in public spaces to enhance social relationships in real-world. In the game, participants assume dual roles, i.e., requesters, who issue social requests, and performers, who respond social requests through performing specified body languages. To mediate the communication between participants, we build an AI-enhanced video analysis system incorporating multiple visual analysis modules like person detection, attribute recognition, and action recognition, to assess the performer's body language quality. A two-week field study involving 27 participants shows significant improvements in their social friendships, as indicated by Self-reported questionnaires. Moreover, user experiences are investigated to highlight the potential of public video cameras as a novel communication medium for socializing in public spaces.

artificial intelligence, machine learning, social media, (18 more...)

arXiv.org Artificial Intelligence

2411.10449

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
Asia > China > Beijing > Beijing (0.04)
North America > United States > Pennsylvania (0.04)
(5 more...)

Genre:

Research Report > Experimental Study (0.46)
Research Report > New Finding (0.46)

Industry:

Leisure & Entertainment > Games > Computer Games (1.00)
Information Technology > Security & Privacy (0.95)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.93)
(3 more...)

Technology:

Information Technology > Human Computer Interaction (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.68)
Information Technology > Artificial Intelligence > Vision > Image Understanding (0.35)

Add feedback