rSIM: Incentivizing Reasoning Capabilities of LLMs via Reinforced Strategy Injection

Open in new window