a13ff984831deea39e6132bafdfdd6d5-Paper-Datasets_and_Benchmarks_Track.pdf

Neural Information Processing Systems 

While recent research suggests that current large Vision-Language Models (VLMs) exhibit more reliance on shape, we find them to still be seriously limited in this regard.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found