Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models