Learning Multimodal LLMs without Text-only Forgetting
–Neural Information Processing Systems
The LoRRA mirrors the structure of attention but utilizes low-rank connections to ensure efficiency. Initially, image and text inputs are aligned with visual learners operating alongside the main attention, balancing focus on visual elements.
Neural Information Processing Systems
Oct-9-2025, 23:25:57 GMT
- Country:
- Asia > China
- Jiangsu Province > Nanjing (0.04)
- North America > United States
- California > Santa Clara County
- Palo Alto (0.04)
- Missouri > St. Louis County
- St. Louis (0.04)
- California > Santa Clara County
- Asia > China
- Genre:
- Research Report > Experimental Study (0.93)
- Industry:
- Education (0.67)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning > Neural Networks
- Deep Learning (1.00)
- Natural Language
- Chatbot (1.00)
- Large Language Model (1.00)
- Representation & Reasoning (1.00)
- Vision (1.00)
- Machine Learning > Neural Networks
- Information Technology > Artificial Intelligence