Predictive Authoring for Brazilian Portuguese Augmentative and Alternative Communication
Pereira, Jayr, Nogueira, Rodrigo, Zanchettin, Cleber, Fidalgo, Robson
–arXiv.org Artificial Intelligence
Individuals with complex communication needs (CCN) often rely on augmentative and alternative communication (AAC) systems to have conversations and communique their wants. Such systems allow message authoring by arranging pictograms in sequence. However, the difficulty of finding the desired item to complete a sentence can increase as the user's vocabulary increases. This paper proposes using BERTimbau, a Brazilian Portuguese version of BERT, for pictogram prediction in AAC systems. To finetune BERTimbau, we constructed an AAC corpus for Brazilian Portuguese to use as a training corpus. We tested different approaches to representing a pictogram for prediction: as a word (using pictogram captions), as a concept (using a dictionary definition), and as a set of synonyms (using related terms). We also evaluated the usage of images for pictogram prediction. The results demonstrate that using embeddings computed from the pictograms' caption, synonyms, or definitions have a similar performance. Using synonyms leads to lower perplexity, but using captions leads to the highest accuracies. This paper provides insight into how to represent a pictogram for prediction using a BERT-like model and the potential of using images for pictogram prediction.
arXiv.org Artificial Intelligence
Aug-18-2023
- Country:
- South America
- Chile > Santiago Metropolitan Region
- Santiago Province > Santiago (0.04)
- Brazil
- Rio Grande do Sul > Porto Alegre (0.04)
- Pernambuco > Recife (0.04)
- Chile > Santiago Metropolitan Region
- North America > United States
- Ohio > Wayne County
- Wooster (0.04)
- New York > New York County
- New York City (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Georgia > Fulton County
- Atlanta (0.04)
- Ohio > Wayne County
- Europe
- Asia
- China (0.04)
- Middle East > Republic of Türkiye
- Batman Province > Batman (0.04)
- Japan > Honshū
- Chūbu > Toyama Prefecture > Toyama (0.04)
- South America
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Health & Medicine > Therapeutic Area > Neurology (1.00)
- Technology:
- Information Technology
- Data Science (1.00)
- Communications (1.00)
- Artificial Intelligence
- Cognitive Science (1.00)
- Representation & Reasoning > Expert Systems (0.67)
- Natural Language
- Large Language Model (1.00)
- Chatbot (0.69)
- Machine Translation (0.67)
- Machine Learning
- Statistical Learning (1.00)
- Neural Networks > Deep Learning (1.00)
- Information Technology