SePPO: Semi-Policy Preference Optimization for Diffusion Alignment