SpecTr: Fast Speculative Decoding via Optimal Transport

Open in new window