Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning Systems

Open in new window