SATER: A Self-Aware and Token-Efficient Approach to Routing and Cascading