BPO: Revisiting Preference Modeling in Direct Preference Optimization

Open in new window