Evaluating Sparse Autoencoders for Monosemantic Representation

Open in new window