ROCM: RLHF on consistency models