Exploring the generalization of LLM truth directions on conversational formats

Open in new window