Splitwiser: Efficient LM inference with constrained resources

Open in new window