Lightweight Vision Transformer with Bidirectional Interaction

Neural Information Processing Systems 

Recent advancements in vision backbones have significantly improved their performance by simultaneously modeling images' local and global contexts.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found