Reinforcement Learning from Human Feedback: Whose Culture, Whose Values, Whose Perspectives?

Barman, Kristian González, Lohse, Simon, de Regt, Henk

Jul-2-2024–arXiv.org Artificial Intelligence

This approach is partic ularly useful when designing AI systems for tasks where it is difficult to specify a precise reward function or when it is important to align the model's behaviour with certain human expectations and values. For instance, RLHF has notably improved language models for context - aware text generation (Ziegler et al. 2020) and taught robots to navigate cluttered environments (Henry et al. 2010) . RLHF is commonly employed in the later stages of fine - tuning models, particularly in the development of prominent Large Language Models (LLMs) like GPT - 3.5 or GPT - 4. Initially, these models undergo training using vast text corpora to grasp a broad range of language patterns and contexts. This foundational training is supplemented by task - specific fine - tuning, where the models are adjusted to excel in particular applications, such as understanding and generating dialogues. The refinement process is then furt her enhanced through RLHF.

diversity, evaluator, rlhf, (15 more...)

arXiv.org Artificial Intelligence

Jul-2-2024

arXiv.org PDF

Add feedback

Country:
- Asia > Middle East (0.04)
- Africa > Middle East (0.04)
- North America
  - Canada (0.04)
  - United States
    - New York > New York County
      - New York City (0.04)
    - New Jersey > Mercer County
      - Princeton (0.04)
    - Illinois > Cook County
      - Chicago (0.04)
    - California > Santa Clara County
      - Palo Alto (0.04)
- Europe
  - Netherlands (0.04)
  - Middle East (0.04)
  - United Kingdom > England
    - Oxfordshire > Oxford (0.04)
    - Cambridgeshire > Cambridge (0.04)
  - Belgium > Flanders
    - East Flanders > Ghent (0.04)

Genre:
- Research Report (0.64)

Industry:
- Law > Civil Rights & Constitutional Law (0.67)
- Information Technology > Security & Privacy (0.46)
- Leisure & Entertainment > Games (0.46)
- Health & Medicine > Epidemiology (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found