Leveraging Reward Models for Guiding Code Review Comment Generation