Multi-Attribute Steering of Language Models via Targeted Intervention

Open in new window