Scaling White-Box Transformers for Vision Jinrui Y ang

Neural Information Processing Systems 

The most extensive model described to date is the base model size encompasses 77.6M parameters

Similar Docs  Excel Report  more

TitleSimilaritySource
None found