Multi-Path Transformer is Better: A Case Study on Neural Machine Translation

Open in new window