ImageTalk: Designing a Multimodal AAC Text Generation System Driven by Image Recognition and Natural Language Generation
Yang, Boyin, Jiang, Puming, Kristensson, Per Ola
–arXiv.org Artificial Intelligence
People living with Motor Neuron Disease (plwMND) frequently encounter speech and motor impairments that necessitate a reliance on augmentative and alternative communication (AAC) systems. This paper tackles the main challenge that traditional symbol-based AAC systems offer a limited vocabulary, while text entry solutions tend to exhibit low communication rates. To help plwMND articulate their needs about the system efficiently and effectively, we iteratively design and develop a novel multimodal text generation system called ImageTalk through a tailored proxy-user-based and an end-user-based design phase. The system demonstrates pronounced keystroke savings of 95.6%, coupled with consistent performance and high user satisfaction. We distill three design guidelines for AI-assisted text generation systems design and outline four user requirement levels tailored for AAC purposes, guiding future research in this field.
arXiv.org Artificial Intelligence
Dec-11-2025
- Country:
- Asia > Japan
- Honshū > Kantō > Kanagawa Prefecture > Yokohama (0.04)
- Europe
- Germany > Hamburg (0.04)
- Greece > Attica
- Athens (0.04)
- United Kingdom
- England
- Cambridgeshire > Cambridge (0.14)
- Greater London > London (0.04)
- Scotland
- City of Dundee > Dundee (0.04)
- City of Edinburgh > Edinburgh (0.04)
- England
- North America > United States
- New York > New York County
- New York City (0.05)
- Oregon > Multnomah County
- Portland (0.04)
- Washington > King County
- Seattle (0.04)
- New York > New York County
- Oceania > Australia
- New South Wales > Sydney (0.04)
- Asia > Japan
- Genre:
- Research Report > New Finding (0.93)
- Industry:
- Health & Medicine > Therapeutic Area > Neurology (0.86)
- Technology: