Judge Decoding: Faster Speculative Sampling Requires Going Beyond Model Alignment

Open in new window