VPGTrans: Transfer Visual Prompt Generator across LLMs 2
–Neural Information Processing Systems
Since developing a new multimodal LLM (MLLM) by pre-training on a tremendous amount of image-text pairs from scratch is exceedingly resource-consuming, connecting an existing LLM with a comparatively lightweight visual prompt generator (VPG) becomes a feasible paradigm.
Neural Information Processing Systems
Mar-21-2025, 20:45:50 GMT
- Country:
- Asia > Middle East
- Israel (0.14)
- Europe > Switzerland
- North America > United States
- California (0.14)
- Asia > Middle East
- Genre:
- Research Report > New Finding (0.93)
- Industry:
- Leisure & Entertainment (0.93)
- Technology: