Making Small Language Models Efficient Reasoners: Intervention, Supervision, Reinforcement

Open in new window