Preference Optimization as Probabilistic Inference

Open in new window