Fast Structured Decoding for Sequence Models

Oct-10-2024, 07:37:45 GMT–Neural Information Processing Systems

Autoregressive sequence models achieve state-of-the-art performance in domains like machine translation. However, due to the autoregressive factorization nature, these models suffer from heavy latency during inference. Recently, non-autoregressive sequence models were proposed to speed up the inference time. However, these models assume that the decoding process of each token is conditionally independent of others. Such a generation process sometimes makes the output sentence inconsistent, and thus the learned non-autoregressive models could only achieve inferior accuracy compared to their autoregressive counterparts.

fast structured decoding, non-autoregressive model, non-autoregressive sequence model, (2 more...)

Neural Information Processing Systems

Oct-10-2024, 07:37:45 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.46)