A Survey on Human Preference Learning for Large Language Models

Jiang, Ruili, Chen, Kehai, Bai, Xuefeng, He, Zhixuan, Li, Juntao, Yang, Muyun, Zhao, Tiejun, Nie, Liqiang, Zhang, Min

Jun-18-2024–arXiv.org Artificial Intelligence

They lack a systematic review ARGE language models (LLMs) [1, 2, 3, 4, 5, 6, 7, 8, 9, 10] have posed a groundbreaking impact on artificial and discussion on preference learning, especially preference intelligence (AI), transforming the opinions of people on the modeling methods, which are critical to capturing human potential of AI systems for understanding and applying human intentions for LM alignment [13]. To further explore effective languages. These neural network language models with largescale preference learning approaches for better LLM alignment, parameters (mainly over 10 billion) are initially pretrained we present a comprehensive review of human preference on large corpora collected from a wide range of learning methods applicable to language models, examining sources, a remarkable part of which is on the Internet [11]. After LLM alignment methods from the perspective of preference pre-training by imitating how humans use natural languages learning. By analyzing a wide range of alignment approaches in the text data, the foundation LLMs acquire strong and within the preference learning framework, we outline the general language skills [1, 12]. On the other hand, foundation holistic picture of introducing human preference into LLMs, LLMs are observed to have difficulty in understanding or enabling insights to be drawn from every aspect of human responding to diverse human instruction appropriately [13], preference learning for various domains.

conf, human preference, proc, (17 more...)

arXiv.org Artificial Intelligence

Jun-18-2024

arXiv.org PDF

Add feedback

Country:
- Asia > China
  - Heilongjiang Province > Harbin (0.04)
  - Guangdong Province > Shenzhen (0.04)
  - Jiangsu Province (0.04)

Genre:
- Overview (1.00)

Industry:
- Education (0.66)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found