A baseline revisited: Pushing the limits of multi-segment models for context-aware translation
Majumder, Suvodeep, Lauly, Stanislas, Nadejde, Maria, Federico, Marcello, Dinu, Georgiana
–arXiv.org Artificial Intelligence
We show that multi-sentence translation can The quality of NMT (Neural Machine Translation) benefit from increased-capacity transformer models has been improving over the years and models and that deeper models are better at is narrowing the gap to human translation performance learning contextual dependencies than wider (Hassan et al., 2018). Until recently, most models. of the MT research has focused on translating and evaluating sentences in isolation, ignoring the context We further show that distilled models can in which these sentences occur. Simplifying learn contextual dependencies from larger the translation task this way has its advantages: models, while reducing computational cost data sets are easier to create, models are computationally and increasing robustness to input length variations.
arXiv.org Artificial Intelligence
Oct-21-2022
- Country:
- Oceania > Fiji (0.04)
- North America
- Dominican Republic (0.04)
- United States
- Texas (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Europe
- Germany > Berlin (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Italy
- Tuscany > Florence (0.04)
- Trentino-Alto Adige/Südtirol > Trentino Province
- Trento (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia
- Africa > Ethiopia
- Addis Ababa > Addis Ababa (0.04)
- Genre:
- Research Report (1.00)
- Technology: