Benchmarking Bias in Large Language Models during Role-Playing

Open in new window