Optimizing Preference Alignment with Differentiable NDCG Ranking

Open in new window