Leveraging Vision-Language Models for Visual Grounding and Analysis of Automotive UI