Universal Sparse Autoencoders: Interpretable Cross-Model Concept Alignment

Open in new window