AutoBench-V: Can Large Vision-Language Models Benchmark Themselves?

Open in new window