VPGTrans: Transfer Visual Prompt Generator across LLMs
–Neural Information Processing Systems
Since developing a new multimodal LLM (MLLM) by pre-training on tremendous image-text pairs from scratch can be exceedingly resource-consuming, connecting an existing LLM with a comparatively lightweight visual prompt generator (VPG) becomes a feasible paradigm.
Neural Information Processing Systems
Dec-24-2025, 20:52:26 GMT
- Technology: