Incorporating BERT into Parallel Sequence Decoding with Adapters

Open in new window