SGLang: Efficient Execution of Structured Language Model Programs
–Neural Information Processing Systems
Large language models (LLMs) are increasingly used for complex tasks that require multiple generation calls, advanced prompting techniques, control flow, and structured inputs/outputs. However, efficient systems are lacking for programming and executing these applications. We introduce SGLang, a system for efficient execution of complex language model programs.
Neural Information Processing Systems
Dec-26-2025, 08:07:15 GMT
- Technology: