Clean Evaluations on Contaminated Visual Language Models