CLaSp: In-Context Layer Skip for Self-Speculative Decoding

Open in new window