Geometric-Averaged Preference Optimization for Soft Preference Labels

Open in new window