Effective Learning for Small Reasoning Models: An Empirical Study on 0.5B Reasoning LLMs

Open in new window