EarlyConvolutionsHelpTransformersSeeBetter
–Neural Information Processing Systems
This large-kernel plus large-stride convolution runs counter to typical design choices of convolutional layers in neural networks.
Neural Information Processing Systems
Feb-12-2026, 02:07:04 GMT
- Technology: