MPNet: Masked and Permuted Pre-training for Language Understanding

Neural Information Processing Systems 

However, XLNet does not leverage the full position information of a sentence and thus suffers from position discrepancy between pre-training and fine-tuning.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found