Why Masking Diffusion Works: Condition on the Jump Schedule for Improved Discrete Diffusion

Open in new window