Which Attention Heads Matter for In-Context Learning?

Open in new window