V2P-Bench: Evaluating Video-Language Understanding with Visual Prompts for Better Human-Model Interaction