SpecTr: Fast Speculative Decoding via Optimal Transport
–Neural Information Processing Systems
However, autoregressive sampling generates tokens one at a time making it slow, and even prohibitive in certain tasks.
Neural Information Processing Systems
Feb-12-2026, 14:06:02 GMT
- Technology: