Hatevolution: What Static Benchmarks Don't Tell Us