Diffusion Glancing Transformer for Parallel Sequence to Sequence Learning

Open in new window