The Geometry of Truth: Emergent Linear Structure in Large Language Model Representations of True/False Datasets

Open in new window