vTensor: Flexible Virtual Tensor Management for Efficient LLM Serving

Open in new window