VQAScore: Evaluating and improving vision-language generative models