Emergent Stack Representations in Modeling Counter Languages Using Transformers

Open in new window