Taylor, Sarah
LLAniMAtion: LLAMA Driven Gesture Animation
Windle, Jonathan, Matthews, Iain, Taylor, Sarah
Co-speech gesturing is an important modality in conversation, providing context and social cues. In character animation, appropriate and synchronised gestures add realism, and can make interactive agents more engaging. Historically, methods for automatically generating gestures were predominantly audio-driven, exploiting the prosodic and speech-related content that is encoded in the audio signal. In this paper we instead experiment with using LLM features for gesture generation that are extracted from text using LLAMA2. We compare against audio features, and explore combining the two modalities in both objective tests and a user study. Surprisingly, our results show that LLAMA2 features on their own perform significantly better than audio features and that including both modalities yields no significant difference to using LLAMA2 features in isolation. We demonstrate that the LLAMA2 based model can generate both beat and semantic gestures without any audio input, suggesting LLMs can provide rich encodings that are well suited for gesture generation.
Modeling Socio-Cultural Phenomena in Online Multi-Party Discourse
Strzalkowski, Tomek (State University of New York - Albany and Polish Academy of Sciences) | Broadwell, George Aaron (State University of New York - Albany) | Stromer-Galley, Jennifer ( State University of New York - Albany ) | Shaikh, Samira (State University of New York - Albany) | Liu, Ting (State University of New York - Albany) | Taylor, Sarah (Lockheed Martin)
We present in this paper, the application of a novel approach to computational modeling, understanding and detection of social phenomena in online multi-party discourse. A two-tiered approach was developed to detect a collection of social phenomena deployed by participants, such as topic control, task control, disagreement and involvement. We discuss how the mid-level social phenomena can be reliably detected in discourse and these measures can be used to differentiate participants of online discourse. Our approach works across different types of online chat and we show results on two specific data sets.