VPGTrans: Transfer Visual Prompt Generator across LLMs 2

Neural Information Processing Systems 

Since developing a new multimodal LLM (MLLM) by pre-training on a tremendous amount of image-text pairs from scratch is exceedingly resource-consuming, connecting an existing LLM with a comparatively lightweight visual prompt generator (VPG) becomes a feasible paradigm.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found