Frame Representation Hypothesis: Multi-Token LLM Interpretability and Concept-Guided Text Generation

Open in new window