Adaptive Preference Scaling for Reinforcement Learning with Human Feedback

Neural Information Processing Systems 

In this paper, we propose a novel adaptive preference loss, underpinned by distributionally robust optimization (DRO), designed to address this uncertainty in preference strength.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found