Object-Level Verbalized Confidence Calibration in Vision-Language Models via Semantic Perturbation

Open in new window