Spectral Editing of Activations for Large Language Model Alignment

Neural Information Processing Systems 

We also extend our method to non-linear editing using feature functions.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found