Communication-Efficient Diffusion Denoising Parallelization via Reuse-then-Predict Mechanism

Jun-23-2026, 03:02:40 GMT–Neural Information Processing Systems

Diffusion models have emerged as a powerful class of generative models across various modalities, including image, video, and audio synthesis. However, their deployment is often limited by significant inference latency, primarily due to the inherently sequential nature of the denoising process. While existing parallelization strategies attempt to accelerate inference by distributing computation across multiple devices, they typically incur high communication overhead, hindering deployment on commercial hardware. To address this challenge, we propose ParaStep, a novel parallelization method based on a reuse-then-predict mechanism that parallelizes diffusion inference by exploiting similarity between adjacent denoising steps. Unlike prior approaches that rely on layer-wise or stage-wise communication, ParaStep employs lightweight, step-wise communication, substantially reducing overhead. ParaStep achieves end-to-end speedups of up to 3.88 on SVD, 2.43 on CogVideoX-2b, and 6.56 on AudioLDM2-large, while maintaining generation quality.

artificial intelligence, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Jun-23-2026, 03:02:40 GMT

Conferences PDF

Add feedback

Country:
- Europe (0.93)
- North America > United States
  - Minnesota (0.28)

Genre:
- Research Report > Experimental Study (1.00)

Industry:
- Information Technology (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Vision (1.00)
  - Natural Language (1.00)
  - Representation & Reasoning (0.93)
  - Machine Learning > Neural Networks
    - Deep Learning (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found