Sneaking Syntax into Transformer Language Models with Tree Regularization

Open in new window