EarlyConvolutionsHelpTransformersSeeBetter

Neural Information Processing Systems 

This large-kernel plus large-stride convolution runs counter to typical design choices of convolutional layers in neural networks.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found