Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs Rui Y ang 1 Ruomeng Ding 2 Yong Lin

Open in new window