Masked Diffusion Models as Energy Minimization
–Neural Information Processing Systems
We present a systematic theoretical framework that interprets masked diffusion models (MDMs) as solutions to energy minimization problems in discrete optimal transport. Specifically, we prove that three distinct energy formulationskinetic, conditional kinetic, and geodesic energyare mathematically equivalent under the structure of MDMs, and that MDMs minimize all three when the mask schedule satisfies a closed-form optimality condition. This unification not only clarifies the theoretical foundations of MDMs, but also motivates practical improvements in sampling. By parameterizing interpolation schedules via Beta distributions, we reduce the schedule design space to a tractable 2D search, enabling efficient post-training tuning without model modification. Experiments on synthetic and real-world benchmarks demonstrate that our energy-inspired schedules outperform hand-crafted baselines, particularly in low-step sampling settings.
Neural Information Processing Systems
Jun-16-2026, 22:27:57 GMT
- Country:
- Europe > Austria (0.28)
- North America
- United States (0.28)
- Canada (0.28)
- Genre:
- Research Report (0.47)
- Technology: