CoT-Space: A Theoretical Framework for Internal Slow-Thinking via Reinforcement Learning

Open in new window