Preference Optimization by Estimating the Ratio of the Data Distribution

Open in new window