RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization

Open in new window