Improving Reasoning for Diffusion Language Models via Group Diffusion Policy Optimization