Causal Head Gating: A Framework for Interpreting Roles of Attention Heads in Transformers

Open in new window