Characterizing Intrinsic Compositionality in Transformers with Tree Projections

Open in new window