OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation

Neural Information Processing Systems 

Tokenizer, serving as a translator to map the intricate visual data into a compact latent space, lies at the core of visual generative models.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found