Directed Acyclic Transformer Pre-training for High-quality Non-autoregressive Text Generation

Open in new window