Fast Structured Decoding for Sequence Models
Sun, Zhiqing, Li, Zhuohan, Wang, Haoqing, He, Di, Lin, Zi, Deng, Zhihong
–Neural Information Processing Systems
Autoregressive sequence models achieve state-of-the-art performance in domains like machine translation. However, due to the autoregressive factorization nature, these models suffer from heavy latency during inference. Recently, non-autoregressive sequence models were proposed to speed up the inference time. However, these models assume that the decoding process of each token is conditionally independent of others. Such a generation process sometimes makes the output sentence inconsistent, and thus the learned non-autoregressive models could only achieve inferior accuracy compared to their autoregressive counterparts.
Neural Information Processing Systems
Mar-18-2020, 21:33:23 GMT
- Technology: