Squeezeformer: An Efficient Transformer for Automatic Speech Recognition

Neural Information Processing Systems 

Layer Normalization operations, and (iii) incorporates an efficient depthwise down-sampling layer to efficiently sub-sample the input signal.