Infor-Coef: Information Bottleneck-based Dynamic Token Downsampling for Compact and Efficient language model

Open in new window