ConfProBench: A Confidence Evaluation Benchmark for MLLM-Based Process Judges

Open in new window