An Efficient Transformer Decoder with Compressed Sub-layers

Open in new window