Representation Noising: A Defence Mechanism Against Harmful Finetuning Jan Wehner 2 Kai Williams 3