Representation Engineering: A Top-Down Approach to AI Transparency

Open in new window