Benchmark Data Contamination of Large Language Models: A Survey