Evaluating Vision-Language Models in the Wild with Human Preferences
–Neural Information Processing Systems
Our comprehensive analysis of 20K real-world interactions reveals important insights into the failure cases of top-performing VLMs.
Neural Information Processing Systems
Oct-10-2025, 03:05:35 GMT
- Country:
- Asia > Middle East
- Jordan (0.04)
- Europe
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Switzerland > Zürich
- Zürich (0.14)
- Ireland > Leinster
- North America
- Canada > Ontario (0.04)
- Dominican Republic (0.04)
- United States
- California > Santa Barbara County
- Santa Barbara (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Michigan > Washtenaw County
- Ann Arbor (0.04)
- Texas (0.04)
- California > Santa Barbara County
- South America
- Chile (0.04)
- Colombia > Meta Department
- Villavicencio (0.04)
- Asia > Middle East
- Genre:
- Research Report (0.67)
- Industry:
- Information Technology (0.67)
- Leisure & Entertainment > Games
- Chess (0.47)
- Computer Games (0.67)
- Media > Film (1.00)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning > Neural Networks
- Deep Learning (1.00)
- Natural Language
- Chatbot (1.00)
- Large Language Model (1.00)
- Representation & Reasoning (1.00)
- Vision (1.00)
- Machine Learning > Neural Networks
- Information Technology > Artificial Intelligence