A Comprehensive Survey of Reward Models: Taxonomy, Applications, Challenges, and Future

Zhong, Jialun, Shen, Wei, Li, Yanzeng, Gao, Songyang, Lu, Hua, Chen, Yicheng, Zhang, Yang, Zhou, Wei, Gu, Jinjie, Zou, Lei

Apr-18-2025–arXiv.org Artificial Intelligence

Reward Model (RM) has demonstrated impressive potential for enhancing Large Language Models (LLM), as RM can serve as a proxy for human preferences, providing signals to guide LLMs' behavior in various tasks. In this paper, we provide a comprehensive overview of relevant research, exploring RMs from the perspectives of preference collection, reward modeling, and usage. Next, we introduce the applications of RMs and discuss the benchmarks for evaluation. Furthermore, we conduct an in-depth analysis of the challenges existing in the field and dive into the potential research directions. This paper is dedicated to providing beginners with a comprehensive introduction to RMs and facilitating future studies. The resources are publicly available at github\footnote{https://github.com/JLZhong23/awesome-reward-models}.

arxiv preprint, large language model, machine learning, (18 more...)

arXiv.org Artificial Intelligence

Apr-18-2025

arXiv.org PDF

Add feedback

Country:
- Asia (1.00)
- North America
  - United States (1.00)
  - Canada (0.68)
- Europe > Austria
  - Vienna (0.15)

Genre:
- Overview (1.00)
- Research Report (0.87)

Industry:
- Leisure & Entertainment (0.45)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (1.00)
    - Chatbot (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found