Evaluating Transformer's Ability to Learn Mildly Context-Sensitive Languages