Dynamic Layer Aggregation for Neural Machine Translation with Routing-by-Agreement

Open in new window