An Efficient Transformer Decoder with Compressed Sub-layers