Beyond Semantic Similarity: Reducing Unnecessary API Calls via Behavior-Aligned Retriever