A Cheaper and Better Diffusion Language Model with Soft-Masked Noise

Open in new window