Deep Bayesian Active Learning for Preference Modeling in Large Language Models

May-27-2025, 18:27:19 GMT–Neural Information Processing Systems

Leveraging human preferences for steering the behavior of Large Language Models (LLMs) has demonstrated notable success in recent years. Nonetheless, data selection and labeling are still a bottleneck for these systems, particularly at large scale. Hence, selecting the most informative points for acquiring human feedback may considerably reduce the cost of preference labeling and unleash the further development of LLMs. Bayesian Active Learning provides a principled framework for addressing this challenge and has demonstrated remarkable success in diverse settings. However, previous attempts to employ it for Preference Modeling did not meet such expectations.

deep bayesian active learning, language model, preference modeling, (2 more...)

Neural Information Processing Systems

May-27-2025, 18:27:19 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Cognitive Science > Problem Solving (0.83)
  - Machine Learning > Learning Graphical Models
    - Directed Networks > Bayesian Learning (0.65)