Statistical Rejection Sampling Improves Preference Optimization

Open in new window