Boosting Deductive Reasoning with Step Signals In RLHF