Mechanistic Interpretability in the Presence of Architectural Obfuscation

Open in new window