Seesaw: High-throughput LLM Inference via Model Re-sharding

Open in new window