Generating Signed Language Instructions in Large-Scale Dialogue Systems

İnan, Mert, Atwell, Katherine, Sicilia, Anthony, Quandt, Lorna, Alikhani, Malihe

arXiv.org Artificial Intelligence 

We introduce a goal-oriented conversational AI system enhanced with American Sign Language (ASL) instructions, presenting the first implementation of such a system on a worldwide multimodal conversational AI platform. Accessible through a touch-based interface, our system receives input from users and seamlessly generates ASL instructions by leveraging retrieval methods and cognitively based gloss translations. Central to our design is a sign translation module powered by Large Language Models, alongside a token-based video retrieval system for delivering instructional content from recipes and wikiHow guides. Our development process is deeply rooted in a commitment to community engagement, incorporating insights from the Deaf and Hard-of-Hearing community, as well as experts in cognitive and ASL learning sciences. The effectiveness of our signing instructions is validated by user feedback, achieving ratings on par with those of the system Figure 1: An overview of our multimodal dialogue system, in its non-signing variant. Additionally, our capable of giving signed instructions to Deaf or system demonstrates exceptional performance Hard-of-Hearing users in ASL. We first translate task in retrieval accuracy and text-generation quality, instructions to an intermediate textual representation measured by metrics such as BERTScore.