SpecTr: Fast Speculative Decoding via Optimal Transport

Neural Information Processing Systems 

However, autoregressive sampling generates tokens one at a time making it slow, and even prohibitive in certain tasks.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found