One-Step Diffusion for Detail-Rich and Temporally Consistent Video Super-Resolution

Neural Information Processing Systems 

It is a challenging problem to reproduce rich spatial details while maintaining temporal consistency in real-world video super-resolution (Real-VSR), especially when we leverage pre-trained generative models such as stable diffusion (SD) for realistic details synthesis.