Adaptive Preference Scaling for Reinforcement Learning with Human Feedback