ConvBERT: Improving BERT with Span-based Dynamic Convolution

Open in new window