Preference Learning Algorithms Do Not Learn Preference Rankings

Feb-17-2026, 17:47:23 GMT–Neural Information Processing Systems

Preference learning algorithms (e.g., RLHF and DPO) are frequently used to steer LLMs to produce generations that are more preferred by humans, but our understanding of their inner workings is still limited.

large language model, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Feb-17-2026, 17:47:23 GMT

Conferences PDF

Add feedback

Country:
- Europe > Austria (0.04)

Genre:
- Research Report
  - Experimental Study (1.00)
  - New Finding (0.93)

Industry:
- Media (0.92)
- Information Technology (0.67)
- Leisure & Entertainment (0.67)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (1.00)
    - Chatbot (0.93)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
b8ce770a6b25e603fbff4a37f9e31edc-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found