Diffusion Glancing Transformer for Parallel Sequence to Sequence Learning