SWEPO: Simultaneous Weighted Preference Optimization for Group Contrastive Alignment

Open in new window