BalanceKV: KV Cache Compression through Discrepancy Theory

Open in new window