Plug-and-Play Training Framework for Preference Optimization

Open in new window