CAOTE: KV Cache Selection for LLMs via Attention Output Error-Based Token Eviction