Distributional Preference Alignment of LLMs via Optimal Transport

May-27-2025, 14:44:08 GMT–Neural Information Processing Systems

Current LLM alignment techniques use pairwise human preferences at a sample level, and as such, they do not imply an alignment on the distributional level. We propose in this paper Alignment via Optimal Transport (AOT), a novel method for distributional preference alignment of LLMs. We introduce a convex relaxation of this first-order stochastic dominance and cast it as an optimal transport problem with a smooth and convex cost. Thanks to the one-dimensional nature of the resulting optimal transport problem and the convexity of the cost, it has a closed-form solution via sorting on empirical measures. We fine-tune LLMs with this AOT objective, which enables alignment by penalizing the violation of the stochastic dominance of the reward distribution of the positive samples on the reward distribution of the negative samples.

artificial intelligence, large language model, natural language, (7 more...)

Neural Information Processing Systems

May-27-2025, 14:44:08 GMT

Conferences Web Page

Add feedback

Genre:
- Research Report > Promising Solution (0.41)

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)