The Dark Side of Rich Rewards: Understanding and Mitigating Noise in VLM Rewards