Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs 2 Yong Lin

Open in new window