ADPO: Anchored Direct Preference Optimization

Open in new window