Rating-based Reinforcement Learning

White, Devin, Wu, Mingkang, Novoseller, Ellen, Lawhern, Vernon, Waytowich, Nick, Cao, Yongcan

Jul-30-2023–arXiv.org Artificial Intelligence

This paper develops a novel rating-based reinforcement learning approach that uses human ratings to obtain human guidance in reinforcement learning. Different from the existing preference-based and ranking-based reinforcement learning paradigms, based on human relative preferences over sample pairs, the proposed rating-based reinforcement learning approach is based on human evaluation of individual trajectories without relative comparisons between sample pairs. The rating-based reinforcement learning approach builds on a new prediction model for human ratings and a novel multi-class loss function. We conduct several experimental studies based on synthetic ratings and real human ratings to evaluate the effectiveness and benefits of the new rating-based reinforcement learning approach.

participant, rating class, rbrl, (15 more...)

arXiv.org Artificial Intelligence

Jul-30-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Texas > Bexar County
    - San Antonio (0.14)
  - Maryland > Prince George's County
    - Adelphi (0.04)
  - Hawaii > Honolulu County
    - Honolulu (0.04)

Genre:
- Research Report > Experimental Study (1.00)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found