Token Sequence Compression for Efficient Multimodal Computing