Demystifying Reasoning Dynamics with Mutual Information: Thinking Tokens are Information Peaks in LLMReasoning

Open in new window