Functional Faithfulness in the Wild: Circuit Discovery with Differentiable Computation Graph Pruning

Open in new window