Efficient Inference for Large Language Model-based Generative Recommendation