VocalBench-DF: A Benchmark for Evaluating Speech LLM Robustness to Disfluency