Vision-centric Token Compression in Large Language Model

Open in new window