Representation Noising: A Defence Mechanism Against Harmful Finetuning Jan Wehner 2 Kai Williams 3

Open in new window