Calibrating Uncertainty Quantification of Multi-Modal LLMs using Grounding