Ultra-Fast Language Generation via Discrete Diffusion Divergence Instruct
Zheng, Haoyang, Liu, Xinyang, Kong, Cindy Xiangrui, Jiang, Nan, Hu, Zheyuan, Luo, Weijian, Deng, Wei, Lin, Guang
–arXiv.org Artificial Intelligence
Fast and high-quality language generation is the holy grail that people pursue in the age of AI. In this work, we introduce Discrete Diffusion Divergence Instruct (DiDi-Instruct), a training-based method that initializes from a pre-trained (masked) discrete diffusion language model (dLLM) and distills a few-step student for fast generation. The resulting DiDi-Instruct model achieves comparable or superior performance to its dLLM teacher and the GPT-2 baseline while enabling up to 64$\times$ acceleration. The theoretical foundation of DiDi-Instruct is a novel framework based on integral KL-divergence minimization, which yields a practical training algorithm. We further introduce grouped reward normalization, intermediate-state matching, and the reward-guided ancestral sampler that significantly improve training stability, model coverage, and inference quality. On OpenWebText, DiDi-Instruct achieves perplexity from 62.2 (8 NFEs) to 18.4 (128 NFEs), which outperforms prior accelerated dLLMs and GPT-2 baseline. These gains come with a negligible entropy loss (around $1\%$) and reduce additional training wall-clock time by more than $20\times$ compared to competing dLLM distillation methods. We further validate the robustness and effectiveness of DiDi-Instruct through extensive ablation studies, model scaling, and the generation of discrete protein sequences. In conclusion, DiDi-Instruct is an efficient yet effective distillation method, enabling language generation in the blink of an eye. We will release both code and models at github.com/haoyangzheng-ai/didi-instruct.
arXiv.org Artificial Intelligence
Oct-2-2025
- Country:
- Africa > Middle East
- Egypt > Cairo Governorate > Cairo (0.04)
- Asia
- Afghanistan (0.04)
- China (0.04)
- India (0.14)
- Japan (0.04)
- Middle East
- Iran > Tehran Province
- Tehran (0.04)
- Iraq (0.04)
- Qatar (0.04)
- Republic of Türkiye (0.04)
- Saudi Arabia > Riyadh Province
- Riyadh (0.04)
- Syria > Damascus Governorate
- Damascus (0.04)
- Iran > Tehran Province
- Russia (0.93)
- Singapore (0.04)
- Atlantic Ocean > North Atlantic Ocean
- Baltic Sea (0.04)
- Europe
- Denmark (0.04)
- Germany (0.04)
- Hungary (0.04)
- Poland > Pomerania Province
- Gdańsk (0.04)
- Russia > Central Federal District
- Moscow Oblast > Moscow (0.04)
- North America > United States
- Montana (0.04)
- California
- Los Angeles County > Los Angeles (0.04)
- San Francisco County > San Francisco (0.04)
- District of Columbia > Washington (0.04)
- Colorado (0.04)
- Virginia (0.04)
- Ohio (0.04)
- Michigan (0.04)
- Illinois > Cook County
- Chicago (0.04)
- North Carolina (0.04)
- Kansas (0.04)
- New York (0.04)
- Texas
- Burnet County (0.04)
- Travis County > Austin (0.04)
- Indiana (0.04)
- Utah (0.04)
- Nevada > Clark County
- Las Vegas (0.04)
- Africa > Middle East
- Genre:
- Personal > Interview (0.92)
- Research Report > New Finding (1.00)
- Industry:
- Education (1.00)
- Government
- Foreign Policy (0.92)
- Military (1.00)
- Regional Government
- Voting & Elections (1.00)
- Media > News (0.67)
- Health & Medicine
- Pharmaceuticals & Biotechnology (0.88)
- Therapeutic Area (0.93)
- Law (1.00)
- Information Technology (1.00)
- Leisure & Entertainment > Games
- Computer Games (0.67)
- Energy > Power Industry (0.94)
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
- Technology: