URO-Bench: A Comprehensive Benchmark for End-to-End Spoken Dialogue Models