Syntactic Inductive Bias in Transformer Language Models: Especially Helpful for Low-Resource Languages?

Open in new window