LongSpec: Long-Context Speculative Decoding with Efficient Drafting and Verification

Open in new window