Circuit Stability Characterizes Language Model Generalization

Open in new window