Self-Training Elicits Concise Reasoning in Large Language Models

Open in new window