Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model

Open in new window