ToFe: Lagged Token Freezing and Reusing for Efficient Vision Transformer Inference

Open in new window