Divert More Attention to Vision-Language Tracking

Neural Information Processing Systems 

Relying on Transformer for complex visual feature learning, object tracking has witnessed the new standard for state-of-the-arts (SOT As).