TemMed-Bench: Evaluating Temporal Medical Image Reasoning in Vision-Language Models