DropDim: A Regularization Method for Transformer Networks

Open in new window