Nearest Neighbor Speculative Decoding for LLM Generation and Attribution Minghan Li1
–Neural Information Processing Systems
Large language models (LLMs) often hallucinate and lack the ability to provide attribution for their generations. Semi-parametric LMs, such as kNN-LM, approach these limitations by refining the output of an LM for a given prompt using its nearest neighbor matches in a non-parametric data store. However, these models often exhibit slow inference speeds and produce non-fluent texts.
Neural Information Processing Systems
Mar-25-2025, 04:13:56 GMT
- Country:
- Asia > Middle East
- UAE (0.14)
- Europe (1.00)
- North America
- Mexico > Mexico City (0.14)
- United States (1.00)
- Asia > Middle East
- Genre:
- Research Report > Experimental Study (0.93)
- Industry:
- Leisure & Entertainment (0.92)
- Media > Music (0.45)
- Technology: