SUV: Scalable Large Language Model Copyright Compliance with Regularized Selective Unlearning

Xu, Tianyang, Liu, Xiaoze, Wu, Feijie, Wang, Xiaoqian, Gao, Jing

arXiv.org Artificial Intelligence 

Since DPO may hinder the LLM's performance in other unrelated tasks, we integrate gradient projection and Fisher information regularization to mitigate the degradation.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found