A Large-Scale Human-Centric Benchmark for Referring Expression Comprehension in the LMM Era

Open in new window