Associating Objects with Transformers for Video Object Segmentation Zongxin Yang 1,2, Y unchao Wei 3,4, Yi Yang 1 1 CCAI, College of Computer Science and Technology, Zhejiang University 2

Neural Information Processing Systems 

Transformer is designed for constructing hierarchical matching and propagation. We conduct extensive experiments on both multi-object and single-object benchmarks to examine AOT variant networks with different complexities.