VQAScore: Evaluating and improving vision-language generative models

Open in new window