Distributional Preference Alignment of LLMs via Optimal Transport
–Neural Information Processing Systems
Current LLM alignment techniques use pairwise human preferences at a sample level, and as such, they do not imply an alignment on the distributional level. We propose in this paper Alignment via Optimal Transport (AOT), a novel method for distributional preference alignment of LLMs.
Neural Information Processing Systems
Dec-27-2025, 04:13:12 GMT