A Detailed Factor Analysis for the Political Compass Test: Navigating Ideologies of Large Language Models

Kamal, Sadia, Prakash, Lalu Prasad Yadav, Rafiuddin, S M, Rakib, Mohammed, Sen, Atriya, Choudhury, Sagnik Ray

Nov-13-2025–arXiv.org Artificial Intelligence

The Political Compass Test (PCT) and similar surveys are commonly used to assess political bias in auto-regressive LLMs. Our rigorous statistical experiments show that while changes to standard generation parameters have minimal effect on PCT scores, prompt phrasing and fine-tuning individually and together can significantly influence results. Interestingly, fine-tuning on politically rich vs. neutral datasets does not lead to different shifts in scores. We also generalize these findings to a similar popular test called 8 Values. Humans do not change their responses to questions when prompted differently (``answer this question'' vs ``state your opinion''), or after exposure to politically neutral text, such as mathematical formulae. But the fact that the models do so raises concerns about the validity of these tests for measuring model bias, and paves the way for deeper exploration into how political and social views are encoded in LLMs.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

Nov-13-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States (1.00)

Genre:
- Questionnaire & Opinion Survey (0.93)
- Research Report
  - New Finding (1.00)
  - Experimental Study (0.97)

Industry:
- Health & Medicine (1.00)
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.67)
- Law > Statutes (0.46)
- Government
  - Immigration & Customs (0.67)
  - Regional Government (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.31)