Position Paper: An Inner Interpretability Framework for AI Inspired by Lessons from Cognitive Neuroscience

Open in new window