Let the LLMStick to Its Strengths: Learning to Route Economical LLM
–Neural Information Processing Systems
Recently, test-time scaling of Large Language Models (LLMs) has emerged as a practical alternative to parameter and data scaling. Reasoning tasks often require large-scale, RLVR-based LLMs, while more economical LLMs can handle simpler tasks. Routing an LLM tailored to suitability (i.e., capability and cost) ensures usability and efficiency. We introduce LLMRec, which routes the most suitable LLM to the user query without pre-inference on the candidate LLM zoo.
Neural Information Processing Systems
Jun-16-2026, 01:46:13 GMT
- Country:
- North America > United States (0.93)
- Genre:
- Research Report > Experimental Study (1.00)
- Technology: