Bench-2-CoP: Can We Trust Benchmarking for EU AI Compliance?