Self-Interpretability: LLMs Can Describe Complex Internal Processes that Drive Their Decisions

Open in new window