ThinK: Thinner Key Cache by Query-Driven Pruning

Open in new window