Large Language Models are Visual Reasoning Coordinators Liangyu Chen,, Bo Li
–Neural Information Processing Systems
Existing methods like ensemble still struggle to aggregate these models with the desired higher-order communications. In this work, we propose Cola, a novel paradigm that coordinates multiple VLMs for visual reasoning.
Neural Information Processing Systems
Oct-9-2025, 09:29:04 GMT
- Country:
- Asia
- Middle East > Jordan (0.04)
- Myanmar > Yangon Region
- Yangon (0.04)
- Singapore (0.04)
- Europe > Slovenia
- Drava > Municipality of Benedikt > Benedikt (0.04)
- North America > United States
- California > Alameda County
- Berkeley (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- California > Alameda County
- Asia
- Genre:
- Overview (0.93)
- Research Report
- New Finding (0.67)
- Promising Solution (0.46)
- Industry:
- Education (0.47)
- Technology: