Nested-TNT: Hierarchical Vision Transformers with Multi-Scale Feature Processing

Open in new window