What Is Preference Optimization Doing, How and Why?

Open in new window