Tree-Planted Transformers: Unidirectional Transformer Language Models with Implicit Syntactic Supervision

Open in new window