Improving Sparse Decomposition of Language Model Activations with Gated Sparse Autoencoders

Open in new window