Speculative Decoding: Exploiting Speculative Execution for Accelerating Seq2seq Generation

Open in new window