How Effective is Task-Agnostic Data Augmentation for Pretrained Transformers?