Think While You Generate: Discrete Diffusion with Planned Denoising
Liu, Sulin, Nam, Juno, Campbell, Andrew, Stärk, Hannes, Xu, Yilun, Jaakkola, Tommi, Gómez-Bombarelli, Rafael
Discrete diffusion has achieved state-of-the-art performance, outperforming or approaching autoregressive models on standard benchmarks. In this work, we introduce Discrete Diffusion with Planned Denoising (DDPD), a novel framework that separates the generation process into two models: a planner and a denoiser. At inference time, the planner selects which positions to denoise next by identifying the most corrupted positions in need of denoising, including both initially corrupted and those requiring additional refinement. This plan-and-denoise approach enables more efficient reconstruction during generation by iteratively identifying and denoising corruptions in the optimal order. DDPD outperforms traditional denoiser-only mask diffusion methods, achieving superior results on language modeling benchmarks such as text8, OpenWebText, and token-based generation on ImageNet $256 \times 256$. Notably, in language modeling, DDPD significantly reduces the performance gap between diffusion-based and autoregressive methods in terms of generative perplexity. Code is available at https://github.com/liusulin/DDPD.
Oct-8-2024
- Country:
- Asia (1.00)
- Europe > United Kingdom
- England (0.14)
- North America > United States
- Kentucky (0.14)
- Massachusetts (0.14)
- Genre:
- Research Report
- Experimental Study (0.46)
- New Finding (0.46)
- Research Report
- Industry:
- Government
- Foreign Policy (0.93)
- Military (0.67)
- Regional Government
- Voting & Elections (1.00)
- Health & Medicine (1.00)
- Information Technology (1.00)
- Law (1.00)
- Law Enforcement & Public Safety (0.67)
- Leisure & Entertainment (1.00)
- Media > Film (0.67)
- Government
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning > Neural Networks
- Deep Learning (1.00)
- Natural Language > Chatbot (0.86)
- Representation & Reasoning (1.00)
- Vision (0.93)
- Machine Learning > Neural Networks
- Communications (1.00)
- Artificial Intelligence
- Information Technology