HeteroSpec: Leveraging Contextual Heterogeneity for Efficient Speculative Decoding

Open in new window