This Prompt is Measuring : Evaluating Bias Evaluation in Language Models

Open in new window