Improving BERT with Hybrid Pooling Network and Drop Mask

Open in new window