Deep Binding of Language Model Virtual Personas: a Study on Approximating Political Partisan Misperceptions
Kang, Minwoo, Moon, Suhong, Lee, Seung Hyeong, Raj, Ayush, Suh, Joseph, Chan, David M., Canny, John
–arXiv.org Artificial Intelligence
Large language models (LLMs) are increasingly capable of simulating human behavior, offering cost-effective ways to estimate user responses to various surveys and polls. However, the questions in these surveys usually reflect socially understood attitudes: the patterns of attitudes of old/young, liberal/conservative, as understood by both members and non-members of those groups. It is not clear whether the LLM binding is \emph{deep}, meaning the LLM answers as a member of a particular in-group would, or \emph{shallow}, meaning the LLM responds as an out-group member believes an in-group member would. To explore this difference, we use questions that expose known in-group/out-group biases. This level of fidelity is critical for applying LLMs to various political science studies, including timely topics on polarization dynamics, inter-group conflict, and democratic backsliding. To this end, we propose a novel methodology for constructing virtual personas with synthetic user "backstories" generated as extended, multi-turn interview transcripts. This approach is justified by the theory of \emph{narrative identity} which argues that personality at the highest level is \emph{constructed} from self-narratives. Our generated backstories are longer, rich in detail, and consistent in authentically describing a singular individual, compared to previous methods. We show that virtual personas conditioned on our backstories closely replicate human response distributions (up to an 87% improvement as measured by Wasserstein Distance) and produce effect sizes that closely match those observed in the original studies of in-group/out-group biases. Altogether, our work extends the applicability of LLMs beyond estimating socially understood responses, enabling their use in a broader range of human studies.
arXiv.org Artificial Intelligence
Sep-3-2025
- Country:
- Asia
- India (0.04)
- Middle East > Jordan (0.04)
- North America > United States
- New York > New York County
- New York City (0.04)
- California
- Alameda County > Berkeley (0.04)
- Los Angeles County > Los Angeles (0.04)
- Santa Clara County > Palo Alto (0.04)
- New Jersey (0.04)
- New Mexico (0.04)
- North Dakota (0.04)
- Michigan (0.04)
- Illinois > Cook County
- Chicago (0.04)
- North Carolina (0.04)
- Oklahoma > Oklahoma County
- Oklahoma City (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Utah (0.14)
- Alaska (0.04)
- New York > New York County
- Oceania > New Zealand (0.04)
- Asia
- Genre:
- Personal > Interview (0.88)
- Questionnaire & Opinion Survey (1.00)
- Research Report > New Finding (1.00)
- Industry:
- Education > Educational Setting (0.93)
- Government > Regional Government
- Health & Medicine
- Consumer Health (1.00)
- Therapeutic Area
- Immunology (1.00)
- Infections and Infectious Diseases (1.00)
- Leisure & Entertainment > Sports (0.92)
- Media > News (0.92)
- Technology: