Efficient Inference for Large Language Model-based Generative Recommendation

Open in new window