The Heuristic Core: Understanding Subnetwork Generalization in Pretrained Language Models

Open in new window