Semantic Soft Bootstrapping: Long Context Reasoning in LLMs without Reinforcement Learning

Open in new window