AILS-NTUA at SemEval-2025 Task 4: Parameter-Efficient Unlearning for Large Language Models using Data Chunking
Premptis, Iraklis, Lymperaiou, Maria, Filandrianos, Giorgos, Mastromichalakis, Orfeas Menis, Voulodimos, Athanasios, Stamou, Giorgos
–arXiv.org Artificial Intelligence
The Unlearning Sensitive Content from Large Language Models task aims to remove targeted datapoints from trained models while minimally affecting their general knowledge. In our work, we leverage parameter-efficient, gradient-based unlearning using low-rank (LoRA) adaptation and layer-focused fine-tuning. To further enhance unlearning effectiveness, we employ data chunking, splitting forget data into disjoint partitions and merging them with cyclically sampled retain samples at a pre-defined ratio. Our task-agnostic method achieves an outstanding forget-retain balance, ranking first on leaderboards and significantly outperforming baselines and competing systems.
arXiv.org Artificial Intelligence
Mar-4-2025
- Country:
- Europe
- Ireland (0.04)
- Italy (0.04)
- Switzerland > Basel-City
- Basel (0.04)
- United Kingdom > England
- Nottinghamshire > Nottingham (0.04)
- North America
- Canada > Ontario
- Toronto (0.04)
- United States
- Oregon > Jackson County
- Medford (0.04)
- Connecticut > Hartford County
- Manchester (0.04)
- Colorado > Jefferson County
- Arvada (0.04)
- Massachusetts (0.04)
- Maryland > Anne Arundel County
- Glen Burnie (0.04)
- Florida > Miami-Dade County
- Miami (0.04)
- Kentucky > Jefferson County
- Louisville (0.04)
- Georgia (0.04)
- Tennessee > Davidson County
- Nashville (0.04)
- New York (0.04)
- Arkansas > Washington County
- Fayetteville (0.04)
- Oregon > Jackson County
- Canada > Ontario
- South America > Falkland Islands (0.04)
- Europe
- Genre:
- Research Report (0.50)
- Industry:
- Information Technology > Security & Privacy (0.93)
- Technology: