Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models Gen Luo

Oct-8-2025, 18:47:43 GMT–Neural Information Processing Systems

Instead of using large neural networks to connect the image encoder and LLM, MMA adopts lightweight modules, i.e., adapters, to bridge the gap between LLMs and VL tasks, which also enables the joint optimization of the image and language

large language model, machine learning, natural language, (16 more...)

Neural Information Processing Systems

Oct-8-2025, 18:47:43 GMT

Conferences PDF

Add feedback

Country:
- Europe > Romania
  - Sud - Muntenia Development Region > Giurgiu County > Giurgiu (0.04)
- Asia > China
  - Fujian Province > Xiamen (0.04)
  - Guangdong Province > Shenzhen (0.04)

Genre:
- Research Report (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.48)

Duplicate Docs Excel Report

Title
5e84e4413268b713f0d4a1b23a9dae57-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found