ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting
–Neural Information Processing Systems
Diffusion-based image super-resolution (SR) methods are mainly limited by the low inference speed due to the requirements of hundreds or even thousands of sampling steps. Existing acceleration sampling techniques inevitably sacrifice performance to some extent, leading to over-blurry SR results. To address this issue, we propose a novel and efficient diffusion model for SR that significantly reduces the number of diffusion steps, thereby eliminating the need for post-acceleration during inference and its associated performance deterioration. Additionally, an elaborate noise schedule is developed to flexibly control the shifting speed and the noise strength during the diffusion process. Extensive experiments demonstrate that the proposed method obtains superior or at least comparable performance to current state-of-the-art methods on both synthetic and real-world datasets, \textit{\textbf{even only with 20 sampling steps}}.
Neural Information Processing Systems
Oct-10-2024, 18:16:57 GMT
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning (0.99)
- Vision (0.65)
- Information Technology > Artificial Intelligence