OWL: Overcoming Window Length-Dependence in Speculative Decoding for Long-Context Inputs

Open in new window