Benchmarking Gender and Political Bias in Large Language Models