Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective

Neural Information Processing Systems 

Experimental results show that image autoregressive modeling with our tokenizer (DiGIT) benefits both image understanding and image generation with the next token prediction principle, which is inherently straightforward for GPT models but challenging for other generative models.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found