Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs Rui Y ang 1 Ruomeng Ding 2 Yong Lin

Neural Information Processing Systems 

While previous research has advocated for constraining policy optimization, our study introduces a novel approach to enhance the reward model's

Similar Docs  Excel Report  more

TitleSimilaritySource
None found