Neural Variational Learning for Grounded Language Acquisition

Pillai, Nisha, Matuszek, Cynthia, Ferraro, Francis

Jul-20-2021–arXiv.org Artificial Intelligence

We propose a learning system in which language is grounded in visual percepts without specific pre-defined categories of terms. We present a unified generative method to acquire a shared semantic/visual embedding that enables the learning of language about a wide range of real-world objects. We evaluate the efficacy of this learning by predicting the semantics of objects and comparing the performance with neural and non-neural inputs. We show that this generative approach exhibits promising results in language grounding without pre-specifying visual categories under low resource settings. Our experiments demonstrate that this approach is generalizable to multilingual, highly varied datasets.

baseline, classification, classifier, (15 more...)

arXiv.org Artificial Intelligence

Jul-20-2021

arXiv.org PDF

Add feedback

Country:
- North America > United States > Maryland
  - Baltimore (0.14)
  - Baltimore County (0.04)

Genre:
- Research Report (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Vision (1.00)
  - Natural Language (1.00)
  - Machine Learning
    - Neural Networks > Deep Learning (0.46)
    - Statistical Learning > Regression (0.30)