SpecDiff-2: Scaling Diffusion Drafter Alignment For Faster Speculative Decoding

Open in new window