VPO: Leveraging the Number of Votes in Preference Optimization

Open in new window