Don't Look Twice: Faster Video Transformers with Run-Length Tokenization Rohan Choudhury
–Neural Information Processing Systems
RL T efficiently finds and removes'runs' of patches that are repeated over time prior to model inference, then replaces them with a single patch and a positional encoding to represent the resulting token's new length.
Neural Information Processing Systems
Feb-10-2026, 19:20:44 GMT
- Country:
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
- Genre:
- Research Report > Experimental Study (0.93)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning > Neural Networks (0.93)
- Natural Language (0.93)
- Vision (1.00)
- Information Technology > Artificial Intelligence