Chatbots Are Cheating on Their Benchmark Tests