Test-time Scaling of LLMs: A Survey from A Subproblem Structure Perspective
Yang, Zhuoyi, Guo, Xu, Zhang, Tong, Xu, Huijuan, Li, Boyang
–arXiv.org Artificial Intelligence
With this paper, we survey techniques for improving the predictive accuracy of pretrained large language models by allocating additional compute at inference time. In categorizing test-time scaling methods, we place special emphasis on how a problem is decomposed into subproblems and on the topological organization of these subproblems whether sequential, parallel, or tree-structured. This perspective allows us to unify diverse approaches such as Chain-of-Thought, Branch-Solve-Merge, and Tree-of-Thought under a common lens. We further synthesize existing analyses of these techniques, highlighting their respective strengths and weaknesses, and conclude by outlining promising directions for future research
arXiv.org Artificial Intelligence
Nov-20-2025