Detecting Benchmark Contamination Through Watermarking