Recall-Extend Dynamics: Enhancing Small Language Models through Controlled Exploration and Refined Offline Integration