VPGTrans: Transfer Visual Prompt Generator across LLMs

Dec-24-2025, 20:52:26 GMT–Neural Information Processing Systems

Since developing a new multimodal LLM (MLLM) by pre-training on tremendous image-text pairs from scratch can be exceedingly resource-consuming, connecting an existing LLM with a comparatively lightweight visual prompt generator (VPG) becomes a feasible paradigm.

name change, transfer visual prompt generator, vpgtran, (8 more...)

Neural Information Processing Systems

Dec-24-2025, 20:52:26 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (0.61)
  - Machine Learning (0.59)