VITA: Zero-Shot Value Functions via Test-Time Adaptation of Vision-Language Models