SpecExec: MassivelyParallelSpeculativeDecoding forInteractiveLLMInferenceonConsumerDevices

Open in new window