Self-Refine Instruction-Tuning for Aligning Reasoning in Language Models