IPO: Your Language Model is Secretly a Preference Classifier

Open in new window