Reviews: Layer-Wise Coordination between Encoder and Decoder for Neural Machine Translation

Oct-7-2024, 10:12:10 GMT–Neural Information Processing Systems

Original Review: This work builds directly off of Transformer networks. They make two contributions to that kind of architecture. The first is to suggest running the encoder and decoder stacks layer by layer instead of running the encoder stack and passing information to the decoder stack. The second is to actually tie the weights of the encoder and decoder. Running a decoder layer right after its corresponding encoder layer processes (rather than running the next encoder layer) is also an interesting augmentation to Transformer networks.

encoder and decoder, layer-wise coordination, neural machine translation, (5 more...)

Neural Information Processing Systems

Oct-7-2024, 10:12:10 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)