Adaptive KV-Cache Compression without Manually Setting Budget

Open in new window