Dynamic Depth Decoding: Faster Speculative Decoding for LLMs

Open in new window