Balanced Token Pruning: Accelerating Vision Language Models Beyond Local Optimization

Open in new window