Modeling Concentrated Cross-Attention for Neural Machine Translation with Gaussian Mixture Model

Open in new window