Cache What Lasts: Token Retention for Memory-Bounded KV Cache in LLMs