Acquiring Clean Language Models from Backdoor Poisoned Datasets by Downscaling Frequency Space

Open in new window