Evaluating Vision-Language Models in the Wild with Human Preferences

Neural Information Processing Systems 

Our comprehensive analysis of 20K real-world interactions reveals important insights into the failure cases of top-performing VLMs.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found