Benchmarking Misuse Mitigation Against Covert Adversaries