Lookahead: An Inference Acceleration Framework for Large Language Model with Lossless Generation Accuracy

Open in new window