Non-Monotonic Latent Alignments for CTC-Based Non-Autoregressive Machine Translation

Oct-10-2024, 15:00:24 GMT–Neural Information Processing Systems

Non-autoregressive translation (NAT) models are typically trained with the cross-entropy loss, which forces the model outputs to be aligned verbatim with the target sentence and will highly penalize small shifts in word positions. Latent alignment models relax the explicit alignment by marginalizing out all monotonic latent alignments with the CTC loss. However, they cannot handle non-monotonic alignments, which is non-negligible as there is typically global word reordering in machine translation. In this work, we explore non-monotonic latent alignments for NAT. We extend the alignment space to non-monotonic alignments to allow for the global word reordering and further consider all alignments that overlap with the target sentence.

alignment, ctc-based non-autoregressive machine translation, non-monotonic latent alignment, (4 more...)

Neural Information Processing Systems

Oct-10-2024, 15:00:24 GMT

Conferences Web Page

Add feedback

Genre:
- Play > Prospect (0.68)

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.79)