Don't Look Twice: Faster Video Transformers with Run-Length Tokenization Rohan Choudhury

Feb-10-2026, 19:20:44 GMT–Neural Information Processing Systems

RL T efficiently finds and removes'runs' of patches that are repeated over time prior to model inference, then replaces them with a single patch and a positional encoding to represent the resulting token's new length.

artificial intelligence, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Feb-10-2026, 19:20:44 GMT

Conferences PDF

Add feedback

Country:
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)

Genre:
- Research Report > Experimental Study (0.93)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks (0.93)
  - Natural Language (0.93)
  - Vision (1.00)

Duplicate Docs Excel Report

Title
3181db351fd3ced43cd589b0b572675d-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found