Think While You Generate: Discrete Diffusion with Planned Denoising

Liu, Sulin, Nam, Juno, Campbell, Andrew, Stärk, Hannes, Xu, Yilun, Jaakkola, Tommi, Gómez-Bombarelli, Rafael

Oct-8-2024–arXiv.org Machine Learning

Discrete diffusion has achieved state-of-the-art performance, outperforming or approaching autoregressive models on standard benchmarks. In this work, we introduce Discrete Diffusion with Planned Denoising (DDPD), a novel framework that separates the generation process into two models: a planner and a denoiser. At inference time, the planner selects which positions to denoise next by identifying the most corrupted positions in need of denoising, including both initially corrupted and those requiring additional refinement. This plan-and-denoise approach enables more efficient reconstruction during generation by iteratively identifying and denoising corruptions in the optimal order. DDPD outperforms traditional denoiser-only mask diffusion methods, achieving superior results on language modeling benchmarks such as text8, OpenWebText, and token-based generation on ImageNet $256 \times 256$. Notably, in language modeling, DDPD significantly reduces the performance gap between diffusion-based and autoregressive methods in terms of generative perplexity. Code is available at https://github.com/liusulin/DDPD.

denoiser, diffusion, probability, (14 more...)

arXiv.org Machine Learning

Oct-8-2024

arXiv.org PDF

Add feedback

Country:
- Oceania > Guam (0.04)
- North America
  - United States
    - California (0.04)
    - Virginia (0.04)
    - Massachusetts > Middlesex County
      - Cambridge (0.04)
    - Kentucky > Jefferson County
      - Louisville (0.04)
  - Canada > Ontario
    - Middlesex County > London (0.04)
- Europe
  - Ukraine (0.14)
  - France (0.14)
  - Eastern Europe (0.04)
  - United Kingdom > England
    - Oxfordshire > Oxford (0.04)
    - Cambridgeshire > Cambridge (0.04)
  - Russia > Central Federal District
    - Moscow Oblast > Moscow (0.04)
  - Lithuania > Vilnius County
    - Vilnius (0.04)
- Asia
  - Russia (0.46)
  - Kazakhstan (0.04)
  - China > Hong Kong (0.04)
  - Middle East > Syria
    - Damascus Governorate > Damascus (0.04)
- Africa > Middle East
  - Morocco (0.04)

Genre:
- Research Report
  - Experimental Study (0.46)
  - New Finding (0.46)

Industry:
- Leisure & Entertainment (1.00)
- Law (1.00)
- Information Technology (1.00)
- Health & Medicine (1.00)
- Media > Film (0.67)
- Law Enforcement & Public Safety (0.67)
- Education (0.67)
- Government
  - Voting & Elections (1.00)
  - Foreign Policy (0.93)
  - Military (0.67)
  - Regional Government
    - North America Government > United States Government (1.00)
    - Europe Government > United Kingdom Government (0.92)

Technology:
- Information Technology
  - Communications (1.00)
  - Artificial Intelligence
    - Representation & Reasoning (1.00)
    - Vision (0.93)
    - Natural Language > Chatbot (0.87)
    - Machine Learning > Neural Networks
      - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found