Draft, Verify, and Improve: Toward Training-Aware Speculative Decoding

Open in new window