Extended Parallel Corpus for Amharic-English Machine Translation
Gezmu, Andargachew Mekonnen, Nürnberger, Andreas, Bati, Tesfaye Bayu
–arXiv.org Artificial Intelligence
This paper describes the acquisition, preprocessing, segmentation, and alignment of an Amharic-English parallel corpus. It will be useful for machine translation of an under-resourced language, Amharic. The corpus is larger than previously compiled corpora; it is released for research purposes. We trained neural machine translation and phrase-based statistical machine translation models using the corpus. In the automatic evaluation, neural machine translation models outperform phrase-based statistical machine translation models.
arXiv.org Artificial Intelligence
Apr-8-2021
- Country:
- South America > Paraguay
- North America
- United States
- Pennsylvania > Philadelphia County
- Philadelphia (0.04)
- New Mexico > Santa Fe County
- Santa Fe (0.04)
- Nevada > Clark County
- Las Vegas (0.04)
- Michigan > Wayne County
- Detroit (0.04)
- Massachusetts > Suffolk County
- Boston (0.04)
- California > San Diego County
- San Diego (0.04)
- Pennsylvania > Philadelphia County
- Canada
- United States
- Europe
- Czechia > Prague (0.04)
- Germany
- Berlin (0.05)
- Saxony-Anhalt > Magdeburg (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Greece > Attica
- Athens (0.04)
- Italy
- Tuscany > Florence (0.04)
- Trentino-Alto Adige/Südtirol > Trentino Province
- Trento (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- United Kingdom > Scotland
- City of Edinburgh > Edinburgh (0.04)
- Asia
- Middle East
- Republic of Türkiye > Istanbul Province
- Istanbul (0.04)
- Qatar > Ad-Dawhah
- Doha (0.04)
- Republic of Türkiye > Istanbul Province
- Japan
- Kyūshū & Okinawa > Kyūshū
- Miyazaki Prefecture > Miyazaki (0.04)
- Hokkaidō > Hokkaidō Prefecture
- Sapporo (0.04)
- Kyūshū & Okinawa > Kyūshū
- Middle East
- Africa
- Genre:
- Research Report (0.40)
- Technology: