Robin: a Suite of Multi-Scale Vision-Language Models and the CHIRP Evaluation Benchmark

Open in new window