Point of Order: Action-Aware LLM Persona Modeling for Realistic Civic Simulation
Merrill, Scott, Srivastava, Shashank
–arXiv.org Artificial Intelligence
Large language models offer opportunities to simulate multi-party deliberation, but realistic modeling remains limited by a lack of speaker-attributed data. Transcripts produced via automatic speech recognition (ASR) assign anonymous speaker labels (e.g., Speaker_1), preventing models from capturing consistent human behavior. This work introduces a reproducible pipeline to transform public Zoom recordings into speaker-attributed transcripts with metadata like persona profiles and pragmatic action tags (e.g., [propose_motion]). We release three local government deliberation datasets: Appellate Court hearings, School Board meetings, and Municipal Council sessions. Fine-tuning LLMs to model specific participants using this "action-aware" data produces a 67% reduction in perplexity and nearly doubles classifier-based performance metrics for speaker fidelity and realism. Turing-style human evaluations show our simulations are often indistinguishable from real deliberations, providing a practical and scalable method for complex realistic civic simulations.
arXiv.org Artificial Intelligence
Nov-25-2025
- Country:
- North America > United States
- District of Columbia (0.04)
- North Carolina (0.04)
- Virginia > Albemarle County (0.05)
- Oceania > New Zealand
- North Island > Waikato (0.04)
- North America > United States
- Genre:
- Personal > Interview (1.00)
- Research Report (0.81)
- Industry:
- Education (1.00)
- Government (1.00)
- Law > Litigation (1.00)
- Technology: