Critique-out-Loud Reward Models