Gradient-free Decoder Inversion in Latent Diffusion Models

Neural Information Processing Systems 

For example, recent video LDMs can generate more than 16 frames, but GPUs with 24 GB memory can only perform gradient-based decoder inversion for 4 frames.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found