Training-Free Loosely Speculative Decoding: Accepting Semantically Correct Drafts Beyond Exact Match

Open in new window