Let the LLM Stick to Its Strengths: Learning to Route Economical LLM
–Neural Information Processing Systems
Recently, test-time scaling of Large Language Models (LLMs) has emerged as a practical alternative to parameter and data scaling. Reasoning tasks often require large-scale, RLVR-based LLMs, while more economical LLMs can handle simpler tasks. Routing an LLM tailored to (, capability and cost) ensures usability and efficiency. We introduce LLMRec, which routes the most suitable LLM to the user query without pre-inference on the candidate LLM zoo.
Neural Information Processing Systems
Jun-11-2026, 11:22:11 GMT
- Technology: