Is Active Persona Inference Necessary for Aligning Small Models to Personal Preferences?
Tang, Zilu, Akyürek, Afra Feyza, Akyürek, Ekin, Wijaya, Derry
–arXiv.org Artificial Intelligence
A prominent issue in aligning language models (LMs) to personalized preferences is underspecification -- the lack of information from users about their preferences. A popular trend of injecting such specification is adding a prefix (e.g. prior relevant conversations) to the current user's conversation to steer preference distribution. Most methods passively model personal preferences with prior example preferences pairs. We ask whether models benefit from actively inferring preference descriptions, and address this question by creating a synthetic personalized alignment dataset based on famous people with known public preferences. We then test how effective finetuned 1-8B size models are at inferring and aligning to personal preferences. Results show that higher-quality active prefixes lead to better generalization, more contextually faithful models, and less systematic biases across different protected attributes. All our results suggest active alignment can lead to a more controllable and efficient path for personalized alignment.
arXiv.org Artificial Intelligence
Sep-30-2025
- Country:
- Africa > Ethiopia (0.04)
- Asia
- Malaysia (0.04)
- Japan (0.04)
- Indonesia (0.04)
- Philippines (0.04)
- China (0.04)
- Middle East > Jordan (0.04)
- Thailand (0.04)
- Singapore (0.04)
- India (0.04)
- Europe
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- United Kingdom > England (0.04)
- Ireland > Leinster
- North America > United States
- California (0.04)
- Connecticut (0.04)
- Florida > Miami-Dade County
- Miami (0.04)
- Illinois > Cook County
- Chicago (0.04)
- Iowa (0.04)
- Kansas (0.04)
- Rhode Island (0.04)
- Wisconsin (0.04)
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (1.00)
- Research Report
- Industry:
- Leisure & Entertainment > Sports (1.00)
- Media (1.00)
- Law > Labor & Employment Law (1.00)
- Banking & Finance (1.00)
- Education (1.00)
- Health & Medicine
- Consumer Health (1.00)
- Health Care Providers & Services (0.92)
- Pharmaceuticals & Biotechnology (0.67)
- Therapeutic Area
- Immunology (1.00)
- Infections and Infectious Diseases (0.93)
- Energy (1.00)
- Information Technology > Security & Privacy (1.00)
- Government > Regional Government
- Technology: