Towards LLM Guardrails via Sparse Representation Steering

Open in new window