Spectral Editing of Activations for Large Language Model Alignment

Open in new window