When, What, and How: Rethinking Retrieval-Enhanced Speculative Decoding

Open in new window