Training Language Models to Reason Efficiently

Open in new window