RRHF (1)

yuanhongyi

Neural Information Processing Systems 

RRHF can align with not only human preferences but also any preferences. As a large language model, Wombat has the possibility to generate unsafe responses. We also conduct experiments on the IMDB dataset for assessing positive movie reviews generation. The task expects the model to give positive and fluent movie review completions based on given partial review input texts. RRHF-OP-128 follows the bottommost workflow in Figure 2 in the main texts.

Duplicate Docs Excel Report

Similar Docs  Excel Report  more

TitleSimilaritySource
None found