Prompt Compression with Context-Aware Sentence Encoding for Fast and Improved LLM Inference

Open in new window