Decouple knowledge from parameters for plug-and-play language modeling
Cheng, Xin, Lin, Yankai, Chen, Xiuying, Zhao, Dongyan, Yan, Rui
–arXiv.org Artificial Intelligence
Pre-trained language models(PLM) have made impressive results in various NLP tasks. It has been revealed that one of the key factors to their success is the parameters of these models implicitly learn all kinds of knowledge during pre-training. However, encoding knowledge implicitly in the model parameters has two fundamental drawbacks. First, the knowledge is neither editable nor scalable once the model is trained, which is especially problematic in that knowledge is consistently evolving. Second, it lacks interpretability and prevents humans from understanding which knowledge PLM requires for a certain problem. In this paper, we introduce PlugLM, a pre-training model with differentiable plug-in memory(DPM). The key intuition is to decouple the knowledge storage from model parameters with an editable and scalable key-value memory and leverage knowledge in an explainable manner by knowledge retrieval in the DPM. To justify this design choice, we conduct evaluations in three settings including: (1) domain adaptation. PlugLM obtains 3.95 F1 improvements across four domains on average without any in-domain pre-training. (2) knowledge update. PlugLM could absorb new knowledge in a training-free way after pre-training is done. (3) in-task knowledge learning. PlugLM could be further improved by incorporating training samples into DPM with knowledge prompting.
arXiv.org Artificial Intelligence
Sep-18-2023
- Country:
- Africa > Ethiopia
- Addis Ababa > Addis Ababa (0.04)
- Asia
- Europe
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Czechia > Prague (0.04)
- Germany > Saarland
- Saarbrücken (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Italy
- Calabria > Catanzaro Province
- Catanzaro (0.04)
- Tuscany > Florence (0.04)
- Calabria > Catanzaro Province
- Belgium > Brussels-Capital Region
- North America
- Canada
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- Quebec > Montreal (0.04)
- British Columbia > Metro Vancouver Regional District
- Dominican Republic (0.04)
- Puerto Rico (0.04)
- United States
- New York > New York County
- New York City (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- Virginia (0.04)
- California > Los Angeles County
- Long Beach (0.04)
- Oregon > Multnomah County
- Portland (0.04)
- District of Columbia (0.04)
- Maryland (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Florida
- Escambia County > Pensacola (0.04)
- Okaloosa County
- Crestview (0.04)
- Fort Walton Beach (0.04)
- Alaska (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Texas > Travis County
- Austin (0.04)
- New York > New York County
- Canada
- Oceania > Australia
- South America > Chile
- Africa > Ethiopia
- Genre:
- Research Report (1.00)
- Industry:
- Technology: