Faster Cascades via Speculative Decoding

Open in new window