Calibrating MLLM-as-a-judge via Multimodal Bayesian Prompt Ensembles

Open in new window