Who Evaluates the Evaluators? On Automatic Metrics for Assessing AI-based Offensive Code Generators

Open in new window