Evaluating Precise Geolocation Inference Capabilities of Vision Language Models