DELTA: Dynamic Layer-Aware Token Attention for Efficient Long-Context Reasoning

Open in new window