Distributional Preference Alignment of LLMs via Optimal Transport

Mar-22-2026, 07:15:51 GMT–Neural Information Processing Systems

Current LLM alignment techniques use pairwise human preferences at a sample level, and as such, they do not imply an alignment on the distributional level. We propose in this paper Alignment via Optimal Transport (AOT), a novel method for distributional preference alignment of LLMs.

artificial intelligence, large language model, natural language, (9 more...)

Neural Information Processing Systems

Mar-22-2026, 07:15:51 GMT

Conferences Web Page

Add feedback

Genre:
- Research Report > Promising Solution (0.39)

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.63)