Neural Mask Generator: Learning to Generate Adaptive Word Maskings for Language Model Adaptation
Kang, Minki, Han, Moonsu, Hwang, Sung Ju
–arXiv.org Artificial Intelligence
We propose a method to automatically generate a domain- and task-adaptive maskings of the given text for self-supervised pre-training, such that we can effectively adapt the language model to a particular target task (e.g. question answering). Specifically, we present a novel reinforcement learning-based framework which learns the masking policy, such that using the generated masks for further pre-training of the target language model helps improve task performance on unseen texts. We use off-policy actor-critic with entropy regularization and experience replay for reinforcement learning, and propose a Transformer-based policy network that can consider the relative importance of words in a given text. We validate our Neural Mask Generator (NMG) on several question answering and text classification datasets using BERT and DistilBERT as the language models, on which it outperforms rule-based masking strategies, by automatically learning optimal adaptive maskings.
arXiv.org Artificial Intelligence
Oct-6-2020
- Country:
- Oceania > Australia
- Victoria > Melbourne (0.04)
- New South Wales > Sydney (0.04)
- North America
- United States
- Texas (0.04)
- Nevada (0.04)
- Utah > Salt Lake County
- Salt Lake City (0.04)
- Oregon > Multnomah County
- Portland (0.04)
- New York
- Richmond County > New York City (0.04)
- Queens County > New York City (0.04)
- New York County > New York City (0.04)
- Kings County > New York City (0.04)
- Bronx County > New York City (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- California
- San Diego County > San Diego (0.04)
- Los Angeles County > Long Beach (0.04)
- Puerto Rico > San Juan
- San Juan (0.04)
- Canada > British Columbia
- United States
- Europe
- Belgium (0.04)
- United Kingdom > Scotland
- City of Edinburgh > Edinburgh (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Italy > Tuscany
- Florence (0.04)
- France > Hauts-de-France
- Asia
- Middle East > Jordan (0.04)
- South Korea
- China
- Yunnan Province > Kunming (0.04)
- Hong Kong (0.04)
- Africa > Ethiopia
- Addis Ababa > Addis Ababa (0.04)
- Oceania > Australia
- Genre:
- Research Report (0.82)
- Industry:
- Health & Medicine (0.46)
- Technology: