STaR: Self-TaughtReasoner BootstrappingReasoningWithReasoning
–Neural Information Processing Systems
For example, [5] demonstrated that LLMs explicitly trained to use "scratchpads" for intermediate steps can attain perfect in-distribution performance on arithmetic, and strong out-of-distribution generalization, while models trained topredict answers directly fail to do either.
Neural Information Processing Systems
Feb-9-2026, 10:34:01 GMT
- Country:
- Europe > France (0.04)
- North America > United States
- California > Santa Clara County > Palo Alto (0.04)
- Genre:
- Research Report (0.46)
- Technology: