SCAR: Sparse Conditioned Autoencoders for Concept Detection and Steering in LLMs

Open in new window