LG-VQ: Language-Guided Codebook Learning

Neural Information Processing Systems 

Although existing methods have shown superior performance, most methods prefer to learn a single-modal codebook ( e.g., image), resulting in suboptimal performance

Similar Docs  Excel Report  more

TitleSimilaritySource
None found