TemplateRL: Structured Template-Guided Reinforcement Learning for LLM Reasoning

Open in new window