From Cloze to Comprehension: Retrofitting Pre-trained Masked Language Model to Pre-trained Machine Reader
Xu, Weiwen, Li, Xin, Zhang, Wenxuan, Zhou, Meng, Lam, Wai, Si, Luo, Bing, Lidong
–arXiv.org Artificial Intelligence
We present Pre-trained Machine Reader (PMR), a novel method for retrofitting pre-trained masked language models (MLMs) to pre-trained machine reading comprehension (MRC) models without acquiring labeled data. PMR can resolve the discrepancy between model pre-training and downstream fine-tuning of existing MLMs. To build the proposed PMR, we constructed a large volume of general-purpose and high-quality MRC-style training data by using Wikipedia hyperlinks and designed a Wiki Anchor Extraction task to guide the MRC-style pre-training. Apart from its simplicity, PMR effectively solves extraction tasks, such as Extractive Question Answering and Named Entity Recognition. PMR shows tremendous improvements over existing approaches, especially in low-resource scenarios. When applied to the sequence classification task in the MRC formulation, PMR enables the extraction of high-quality rationales to explain the classification process, thereby providing greater prediction explainability. PMR also has the potential to serve as a unified model for tackling various extraction and classification tasks in the MRC formulation.
arXiv.org Artificial Intelligence
Oct-16-2023
- Country:
- Oceania > Australia
- North America
- Dominican Republic (0.04)
- United States
- Colorado (0.04)
- Washington > King County
- Seattle (0.04)
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Canada
- Ontario > Toronto (0.04)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- Europe > Italy
- Asia
- Middle East > Syria (0.05)
- Japan (0.05)
- China > Hong Kong (0.04)
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Education (1.00)
- Leisure & Entertainment > Sports
- Football (1.00)
- Technology: