Emergent Modularity in Pre-trained Transformers