VHELM: A Holistic Evaluation of Vision Language Models Tony Lee 1 Haoqin T u 2 Chi Heem Wong
–Neural Information Processing Systems
Our framework is designed to be lightweight and automatic so that evaluation runs are cheap and fast. Our initial run evaluates 22 VLMs on 21 existing datasets to provide a holistic snapshot of the models. We uncover new key findings, such as the fact that efficiency-focused models (e.g., Claude 3 Haiku or Gemini 1.5 Flash) perform significantly
Neural Information Processing Systems
Oct-10-2025, 22:31:06 GMT
- Country:
- Asia > Japan (0.04)
- North America
- Montserrat (0.04)
- United States
- California
- Santa Clara County > Palo Alto (0.04)
- Santa Cruz County > Santa Cruz (0.04)
- North Carolina > Orange County
- Chapel Hill (0.04)
- California
- South America > Peru
- Cusco Department > Cusco Province > Cusco (0.04)
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Education > Educational Setting (0.46)
- Health & Medicine (1.00)
- Law (0.67)
- Technology: