Accessing Higher-level Representations in Sequential Transformers with Feedback Memory

Open in new window