Scaling Training of HuggingFace Transformers With Determined