Entropy Controllable Direct Preference Optimization

Open in new window