LLaDA 1.5: Variance-Reduced Preference Optimization for Large Language Diffusion Models