Sensing and Steering Stereotypes: Extracting and Applying Gender Representation Vectors in LLMs

Open in new window