Discovering Preference Optimization Algorithms with and for Large Language Models Chris Lu

Feb-16-2026, 23:50:24 GMT–Neural Information Processing Systems

Typically, preference optimization is approached as an offline supervised learning task using manually crafted convex loss functions. While these methods are based on theoretical insights, they are inherently constrained by human creativity, so the large search space of possible loss functions remains under-explored.

large language model, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Feb-16-2026, 23:50:24 GMT

Conferences PDF

Add feedback

Country:
- South America > Chile
  - Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
- North America > United States
  - Massachusetts > Hampshire County
    - Amherst (0.04)
  - Illinois > Cook County
    - Chicago (0.04)
- Europe > United Kingdom
  - England
    - Cambridgeshire > Cambridge (0.04)
    - Oxfordshire > Oxford (0.04)
- Asia > Middle East
  - Jordan (0.04)

Genre:
- Research Report > Experimental Study (1.00)

Industry:
- Media (0.68)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
9d88b87b31986f8293bb0067a841579e-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found