Language Models Learn Rare Phenomena from Less Rare Phenomena: The Case of the Missing AANNs

Open in new window