Is Active Persona Inference Necessary for Aligning Small Models to Personal Preferences?

Tang, Zilu, Akyürek, Afra Feyza, Akyürek, Ekin, Wijaya, Derry

Sep-30-2025–arXiv.org Artificial Intelligence

A prominent issue in aligning language models (LMs) to personalized preferences is underspecification -- the lack of information from users about their preferences. A popular trend of injecting such specification is adding a prefix (e.g. prior relevant conversations) to the current user's conversation to steer preference distribution. Most methods passively model personal preferences with prior example preferences pairs. We ask whether models benefit from actively inferring preference descriptions, and address this question by creating a synthetic personalized alignment dataset based on famous people with known public preferences. We then test how effective finetuned 1-8B size models are at inferring and aligning to personal preferences. Results show that higher-quality active prefixes lead to better generalization, more contextually faithful models, and less systematic biases across different protected attributes. All our results suggest active alignment can lead to a more controllable and efficient path for personalized alignment.

large language model, machine learning, persona, (21 more...)

arXiv.org Artificial Intelligence

Sep-30-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States (1.00)
- Europe (1.00)
- Asia (1.00)

Genre:
- Research Report
  - New Finding (1.00)
  - Experimental Study (1.00)

Industry:
- Information Technology > Security & Privacy (1.00)
- Energy (1.00)
- Education (1.00)
- Banking & Finance (1.00)
- Law > Labor & Employment Law (1.00)
- Media (1.00)
- Leisure & Entertainment > Sports (1.00)
- Government > Regional Government
  - North America Government > United States Government (1.00)
- Health & Medicine
  - Consumer Health (1.00)
  - Health Care Providers & Services (0.92)
  - Pharmaceuticals & Biotechnology (0.67)
  - Therapeutic Area
    - Immunology (1.00)
    - Infections and Infectious Diseases (0.93)

Technology:
- Information Technology
  - Security & Privacy (1.00)
  - Communications > Social Media (0.93)
  - Artificial Intelligence
    - Representation & Reasoning > Personal Assistant Systems (0.93)
    - Natural Language
      - Large Language Model (1.00)
      - Chatbot (1.00)
    - Machine Learning > Neural Networks
      - Deep Learning (1.00)