SePPO: Semi-Policy Preference Optimization for Diffusion Alignment

Open in new window