RRHF (1)

Feb-8-2026, 23:56:44 GMT–Neural Information Processing Systems

RRHF can align with not only human preferences but also any preferences. As a large language model, Wombat has the possibility to generate unsafe responses. We also conduct experiments on the IMDB dataset for assessing positive movie reviews generation. The task expects the model to give positive and fluent movie review completions based on given partial review input texts. RRHF-OP-128 follows the bottommost workflow in Figure 2 in the main texts.

large language model, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Feb-8-2026, 23:56:44 GMT

Conferences PDF

Add feedback

Country:
- Oceania
  - New Zealand (0.05)
  - Australia > Tasmania (0.05)

Industry:
- Media > Film (0.56)
- Leisure & Entertainment (0.56)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (1.00)
  - Natural Language > Large Language Model (0.53)

Duplicate Docs Excel Report

Title
RRHF (1)

Similar Docs Excel Report more

Title	Similarity	Source
None found