Draft-based Approximate Inference for LLMs