ThinKV: Thought-Adaptive KV Cache Compression for Efficient Reasoning Models

Open in new window