Harnessing Dataset Cartography for Improved Compositional Generalization in Transformers

Open in new window