Evaluating Transformer's Ability to Learn Mildly Context-Sensitive Languages

Open in new window