Control-DAG: Constrained Decoding for Non-Autoregressive Directed Acyclic T5 using Weighted Finite State Automata
Chen, Jinghong, Lin, Weizhe, Mei, Jingbiao, Byrne, Bill
–arXiv.org Artificial Intelligence
The Directed Acyclic Transformer is a fast non-autoregressive (NAR) model that performs well in Neural Machine Translation. Two issues prevent its application to general Natural Language Generation (NLG) tasks: frequent Out-Of-Vocabulary (OOV) errors and the inability to faithfully generate entity names. We introduce Control-DAG, a constrained decoding algorithm for our Directed Acyclic T5 (DA-T5) model which offers lexical, vocabulary and length control. We show that Control-DAG significantly enhances DA-T5 on the Schema Guided Dialogue and the DART datasets, establishing strong NAR results for Task-Oriented Dialogue and Data-to-Text NLG.
arXiv.org Artificial Intelligence
Apr-10-2024
- Country:
- Asia > Middle East
- UAE (0.14)
- North America > United States
- California (0.15)
- Louisiana (0.14)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Asia > Middle East
- Genre:
- Research Report (0.64)
- Technology: