Deep Bayesian Active Learning for Preference Modeling in Large Language Models

Oct-10-2025, 17:54:55 GMT–Neural Information Processing Systems

We address this by proposing the B ayesian A ctive L earner for P reference M odeling (BAL-PM), a novel stochastic acquisition policy that not only targets points of high epistemic uncertainty according to the preference model but also seeks to maximize the entropy of the acquired prompt distribution in the feature space spanned by the employed LLM.

epistemic uncertainty, neural information processing system, preference modeling, (12 more...)

Neural Information Processing Systems

Oct-10-2025, 17:54:55 GMT

Conferences PDF

Add feedback

Country:
- North America
  - Puerto Rico (0.04)
  - United States
    - California (0.04)
    - Massachusetts > Middlesex County
      - Cambridge (0.04)
- Europe
  - United Kingdom > England
    - Oxfordshire > Oxford (0.04)
  - Ireland > Leinster
    - County Dublin > Dublin (0.04)
  - Denmark > Capital Region
    - Copenhagen (0.04)
  - Belgium > Brussels-Capital Region
    - Brussels (0.04)
- Asia
  - Singapore (0.04)
  - Indonesia > Bali (0.04)

Genre:
- Research Report > Experimental Study (0.93)

Industry:
- Health & Medicine (0.67)
- Energy (0.45)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Representation & Reasoning > Uncertainty
    - Bayesian Inference (0.67)
  - Machine Learning
    - Statistical Learning (1.00)
    - Neural Networks > Deep Learning (1.00)
    - Learning Graphical Models > Directed Networks
      - Bayesian Learning (0.83)

Duplicate Docs Excel Report

Title
d5e256c988bdee59a0f4d7a9bc1dd6d9-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found