Transcoders Find Interpretable LLM Feature Circuits
–Neural Information Processing Systems
A key goal in mechanistic interpretability is circuit analysis: finding sparse sub-graphs of models corresponding to specific behaviors or capabilities.
Neural Information Processing Systems
Nov-15-2025, 07:16:57 GMT
- Country:
- Europe > Monaco (0.04)
- North America > United States
- Connecticut > New Haven County > New Haven (0.04)
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (0.92)
- Research Report
- Technology: