Data-Centric Human Preference Optimization with Rationales

Open in new window