Evaluating Machine Translation Quality with Conformal Predictive Distributions
–arXiv.org Artificial Intelligence
This paper presents a new approach for assessing uncertainty in machine translation by simultaneously evaluating translation quality and providing a reliable confidence score. Our approach utilizes conformal predictive distributions to produce prediction intervals with guaranteed coverage, meaning that for any given significance level $\epsilon$, we can expect the true quality score of a translation to fall out of the interval at a rate of $1-\epsilon$. In this paper, we demonstrate how our method outperforms a simple, but effective baseline on six different language pairs in terms of coverage and sharpness. Furthermore, we validate that our approach requires the data exchangeability assumption to hold for optimal performance.
arXiv.org Artificial Intelligence
Jun-2-2023
- Country:
- North America
- Dominican Republic (0.04)
- United States
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- California > San Francisco County
- San Francisco (0.14)
- Minnesota > Hennepin County
- Canada > Quebec
- Montreal (0.04)
- Europe
- Germany > Berlin (0.04)
- United Kingdom > England
- Surrey (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Bulgaria > Sofia City Province
- Sofia (0.04)
- Asia > China
- Hong Kong (0.04)
- North America
- Genre:
- Research Report > New Finding (0.46)
- Technology: