Dynamic Attention-Guided Diffusion for Image Super-Resolution
Moser, Brian B., Frolov, Stanislav, Raue, Federico, Palacio, Sebastian, Dengel, Andreas
–arXiv.org Artificial Intelligence
Diffusion models in image Super-Resolution (SR) treat all image regions with uniform intensity, which risks compromising the overall image quality. To address this, we introduce "You Only Diffuse Areas" (YODA), a dynamic attention-guided diffusion method for image SR. YODA selectively focuses on spatial regions using attention maps derived from the low-resolution image and the current time step in the diffusion process. This time-dependent targeting enables a more efficient conversion to high-resolution outputs by focusing on areas that benefit the most from the iterative refinement process, i.e., detail-rich objects. We empirically validate YODA by extending leading diffusion-based methods SR3 and SRDiff. Our experiments demonstrate new state-of-the-art performance in face and general SR across PSNR, SSIM, and LPIPS metrics. A notable finding is YODA's stabilization effect by reducing color shifts, especially when training with small batch sizes.
arXiv.org Artificial Intelligence
Mar-7-2024
- Country:
- Europe > Germany > Rhineland-Palatinate
- Kaiserslautern (0.04)
- Landau (0.04)
- Europe > Germany > Rhineland-Palatinate
- Genre:
- Research Report (0.82)
- Technology: