Semi-Synthetic Transformers for Evaluating Mechanistic Interpretability Techniques

Open in new window