CLOVER: A Test Case Generation Benchmark with Coverage, Long-Context, and Verification