SCOPE: Saliency-Coverage Oriented Token Pruning for Efficient Multimodel LLMs

Jun-14-2026, 07:01:13 GMT–Neural Information Processing Systems

Multimodal Large Language Models (MLLMs) typically process a large number of visual tokens, leading to considerable computational overhead, even though many of these tokens are redundant. Existing visual token pruning methods primarily focus on selecting the most salient tokens based on attention scores, resulting in the semantic incompleteness of the selected tokens.

artificial intelligence, large language model, natural language, (6 more...)

Neural Information Processing Systems

Jun-14-2026, 07:01:13 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.40)