Low-resource Neural Machine Translation with Cross-modal Alignment
Yang, Zhe, Fang, Qingkai, Feng, Yang
–arXiv.org Artificial Intelligence
How to achieve neural machine translation with limited parallel data? Existing techniques often rely on large-scale monolingual corpora, which is impractical for some low-resource languages. In this paper, we turn to connect several low-resource languages to a particular high-resource one by additional visual modality. Specifically, we propose a cross-modal contrastive learning method to learn a shared space for all languages, where both a coarse-grained sentence-level objective and a fine-grained token-level one are introduced. Experimental results and further analysis show that our method can effectively learn the cross-modal and cross-lingual alignment with a small amount of image-text pairs and achieves significant improvements over the text-only baseline under both zero-shot and few-shot scenarios.
arXiv.org Artificial Intelligence
Oct-13-2022
- Country:
- North America
- Dominican Republic (0.04)
- United States
- Pennsylvania > Philadelphia County
- Philadelphia (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.28)
- Hawaii > Honolulu County
- Honolulu (0.04)
- California > Los Angeles County
- Long Beach (0.04)
- Pennsylvania > Philadelphia County
- Canada > British Columbia
- Europe
- Germany > Berlin (0.04)
- Austria (0.04)
- Switzerland > Zürich
- Zürich (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Italy > Tuscany
- Florence (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia > China
- North America
- Genre:
- Research Report (0.50)
- Technology: