AdapterSwap: Continuous Training of LLMs with Data Removal and Access-Control Guarantees

Fleshman, William, Khan, Aleem, Marone, Marc, Van Durme, Benjamin

Apr-12-2024–arXiv.org Artificial Intelligence

Large language models (LLMs) are increasingly capable of completing knowledge intensive tasks by recalling information from a static pretraining corpus. Here we are concerned with LLMs in the context of evolving data requirements. For instance: batches of new data that are introduced periodically; subsets of data with user-based access controls; or requirements on dynamic removal of documents with guarantees that associated knowledge cannot be recalled. We wish to satisfy these requirements while at the same time ensuring a model does not forget old information when new data becomes available. To address these issues, we introduce AdapterSwap, a training and inference scheme that organizes knowledge from a data collection into a set of low-rank adapters, which are dynamically composed during inference. Our experiments demonstrate AdapterSwap's ability to support efficient continual learning, while also enabling organizations to have fine-grained control over data access and deletion.

adapter, adapterswap, computational linguistic, (15 more...)

arXiv.org Artificial Intelligence

Apr-12-2024

arXiv.org PDF

Add feedback

Country:
- North America
  - Dominican Republic (0.04)
  - United States > Washington
    - King County > Seattle (0.04)
  - Canada > Ontario
    - Toronto (0.04)
- Europe
  - Romania > Sud - Muntenia Development Region
    - Giurgiu County > Giurgiu (0.04)
  - Croatia > Dubrovnik-Neretva County
    - Dubrovnik (0.04)
- Asia
  - China > Hong Kong (0.04)
  - Singapore (0.04)
  - Indonesia > Bali (0.04)
  - Middle East > UAE
    - Abu Dhabi Emirate > Abu Dhabi (0.04)

Genre:
- Research Report (1.00)

Industry:
- Law (1.00)
- Information Technology > Security & Privacy (1.00)
- Government > Regional Government (0.93)

Technology:
- Information Technology
  - Security & Privacy (1.00)
  - Artificial Intelligence > Natural Language
    - Large Language Model (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found