Accelerated Sampling from Masked Diffusion Models via Entropy Bounded Unmasking
–Neural Information Processing Systems
Recent masked diffusion models (MDMs) have shown competitive performance compared to autoregressive models (ARMs) for language modeling. While most literature has focused on performance enhancing sampling procedures, efficient sampling from MDMs has been scarcely explored. We make the observation that often a given sequence of partially masked tokens determines the values of multiple unknown tokens deterministically, meaning that a single prediction of a masked model holds additional information unused by standard sampling procedures.
Neural Information Processing Systems
Jun-12-2026, 04:16:43 GMT
- Technology: