Latent Paraphrasing: Perturbation on Layers Improves Knowledge Injection in Language Models
Kang, Minki, Hwang, Sung Ju, Lee, Gibbeum, Cho, Jaewoong
–arXiv.org Artificial Intelligence
As Large Language Models (LLMs) are increasingly deployed in specialized domains with continuously evolving knowledge, the need for timely and precise knowledge injection has become essential. Fine-tuning with paraphrased data is a common approach to enhance knowledge injection, yet it faces two significant challenges: high computational costs due to repetitive external model usage and limited sample diversity. To this end, we introduce LaPael, a latent-level paraphrasing method that applies input-dependent noise to early LLM layers. This approach enables diverse and semantically consistent augmentations directly within the model. Furthermore, it eliminates the recurring costs of paraphrase generation for each knowledge update. Our extensive experiments on question-answering benchmarks demonstrate that LaPael improves knowledge injection over standard fine-tuning and existing noise-based approaches. Additionally, combining LaPael with data-level paraphrasing further enhances performance.
arXiv.org Artificial Intelligence
Nov-1-2024
- Country:
- North America
- Dominican Republic (0.04)
- United States
- District of Columbia > Washington (0.04)
- New York (0.04)
- Maryland > Baltimore (0.04)
- Colorado (0.04)
- Texas > Travis County
- Austin (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- California > Los Angeles County
- Long Beach (0.04)
- Canada
- Ontario > Toronto (0.04)
- Alberta > Census Division No. 15
- Improvement District No. 9 > Banff (0.04)
- Europe > Spain
- Asia
- China > Hong Kong (0.04)
- Singapore (0.04)
- Japan (0.04)
- Indonesia > Bali (0.04)
- Myanmar > Tanintharyi Region
- Dawei (0.04)
- Middle East
- Jordan (0.04)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.04)
- Africa
- Sudan
- Khartoum State > Khartoum (0.04)
- Khartoum (0.04)
- Rwanda > Kigali
- Kigali (0.04)
- Ethiopia > Addis Ababa
- Addis Ababa (0.04)
- Sudan
- North America
- Genre:
- Research Report > New Finding (0.68)
- Industry:
- Media (0.67)
- Leisure & Entertainment > Sports
- Football (1.00)
- Technology: