Causal Language Control in Multilingual Transformers via Sparse Feature Steering

Open in new window