CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers

Open in new window