Data-Centric Human Preference Optimization with Rationales