Large Language Models for Virtual Human Gesture Selection

Torshizi, Parisa Ghanad, Hensel, Laura B., Shapiro, Ari, Marsella, Stacy C.

Mar-18-2025–arXiv.org Artificial Intelligence

Co-speech gestures convey a wide variety of meanings and play an important role in face-to-face human interactions. These gestures significantly influence the addressee's engagement, recall, comprehension, and attitudes toward the speaker. Similarly, they impact interactions between humans and embodied virtual agents. The process of selecting and animating meaningful gestures has thus become a key focus in the design of these agents. However, automating this gesture selection process poses a significant challenge. Prior gesture generation techniques have varied from fully automated, data-driven methods, which often struggle to produce contextually meaningful gestures, to more manual approaches that require crafting specific gesture expertise and are time-consuming and lack generalizability. In this paper, we leverage the semantic capabilities of Large Language Models to develop a gesture selection approach that suggests meaningful, appropriate co-speech gestures. We first describe how information on gestures is encoded into GPT-4. Then, we conduct a study to evaluate alternative prompting approaches for their ability to select meaningful, contextually relevant gestures and to align them appropriately with the co-speech utterance. Finally, we detail and demonstrate how this approach has been implemented within a virtual agent system, automating the selection and subsequent animation of the selected gestures for enhanced human-agent interactions.

large language model, machine learning, utterance, (20 more...)

arXiv.org Artificial Intelligence

Mar-18-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New York > New York County
    - New York City (0.04)
  - Michigan > Wayne County
    - Detroit (0.05)
  - Massachusetts > Suffolk County
    - Boston (0.04)
  - Illinois > Cook County
    - Chicago (0.04)
  - California
    - Los Angeles County > Los Angeles (0.14)
    - Orange County > Anaheim (0.04)
- Europe > United Kingdom
  - Scotland > City of Glasgow
    - Glasgow (0.04)
  - England > Cambridgeshire
    - Cambridge (0.04)

Genre:
- Research Report > New Finding (0.93)

Industry:
- Health & Medicine (0.93)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Agents (1.00)
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.93)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found