Evaluating Precise Geolocation Inference Capabilities of Vision Language Models

Open in new window