AILS-NTUA at SemEval-2025 Task 4: Parameter-Efficient Unlearning for Large Language Models using Data Chunking

Premptis, Iraklis, Lymperaiou, Maria, Filandrianos, Giorgos, Mastromichalakis, Orfeas Menis, Voulodimos, Athanasios, Stamou, Giorgos

Mar-4-2025–arXiv.org Artificial Intelligence

The Unlearning Sensitive Content from Large Language Models task aims to remove targeted datapoints from trained models while minimally affecting their general knowledge. In our work, we leverage parameter-efficient, gradient-based unlearning using low-rank (LoRA) adaptation and layer-focused fine-tuning. To further enhance unlearning effectiveness, we employ data chunking, splitting forget data into disjoint partitions and merging them with cyclically sampled retain samples at a pre-defined ratio. Our task-agnostic method achieves an outstanding forget-retain balance, ranking first on leaderboards and significantly outperforming baselines and competing systems.

batch size, epoch, hyperparameter, (15 more...)

arXiv.org Artificial Intelligence

Mar-4-2025

arXiv.org PDF

Add feedback

Country:
- South America > Falkland Islands (0.04)
- North America
  - United States
    - Massachusetts (0.04)
    - New York (0.04)
    - Georgia (0.04)
    - Arkansas > Washington County
      - Fayetteville (0.04)
    - Tennessee > Davidson County
      - Nashville (0.04)
    - Kentucky > Jefferson County
      - Louisville (0.04)
    - Florida > Miami-Dade County
      - Miami (0.04)
    - Maryland > Anne Arundel County
      - Glen Burnie (0.04)
    - Colorado > Jefferson County
      - Arvada (0.04)
    - Connecticut > Hartford County
      - Manchester (0.04)
    - Oregon > Jackson County
      - Medford (0.04)
  - Canada > Ontario
    - Toronto (0.04)
- Europe
  - Italy (0.04)
  - Ireland (0.04)
  - United Kingdom > England
    - Nottinghamshire > Nottingham (0.04)
  - Switzerland > Basel-City
    - Basel (0.04)

Genre:
- Research Report (0.50)

Industry:
- Information Technology > Security & Privacy (0.93)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (0.71)
  - Machine Learning > Statistical Learning
    - Gradient Descent (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found