Deep Transfer Learning for NLP with Transformers