A Detailed Factor Analysis for the Political Compass Test: Navigating Ideologies of Large Language Models
Kamal, Sadia, Prakash, Lalu Prasad Yadav, Rafiuddin, S M, Rakib, Mohammed, Sen, Atriya, Choudhury, Sagnik Ray
–arXiv.org Artificial Intelligence
The Political Compass Test (PCT) and similar surveys are commonly used to assess political bias in auto-regressive LLMs. Our rigorous statistical experiments show that while changes to standard generation parameters have minimal effect on PCT scores, prompt phrasing and fine-tuning individually and together can significantly influence results. Interestingly, fine-tuning on politically rich vs. neutral datasets does not lead to different shifts in scores. We also generalize these findings to a similar popular test called 8 Values. Humans do not change their responses to questions when prompted differently (``answer this question'' vs ``state your opinion''), or after exposure to politically neutral text, such as mathematical formulae. But the fact that the models do so raises concerns about the validity of these tests for measuring model bias, and paves the way for deeper exploration into how political and social views are encoded in LLMs.
arXiv.org Artificial Intelligence
Nov-13-2025
- Country:
- Asia > Thailand
- North America
- Canada
- British Columbia > Vancouver (0.04)
- Ontario > Toronto (0.04)
- United States
- Florida > Miami-Dade County
- Miami (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Oklahoma > Payne County
- Stillwater (0.04)
- Oregon > Multnomah County
- Portland (0.04)
- Texas (0.14)
- Florida > Miami-Dade County
- Canada
- Genre:
- Questionnaire & Opinion Survey (0.93)
- Research Report
- Experimental Study (0.97)
- New Finding (1.00)
- Industry:
- Government
- Immigration & Customs (0.67)
- Regional Government (0.46)
- Health & Medicine (0.93)
- Law > Statutes (0.46)
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.67)
- Government
- Technology: