Distilling Reasoning Capabilities into Smaller Language Models