Steering Language Models with Weight Arithmetic

Open in new window