Optimizing I/O for GPU performance tuning of deep learning training in Amazon SageMaker
GPUs can significantly speed up deep learning training, and have the potential to reduce training time from weeks to just hours. Amazon SageMaker is a fully managed service that enables developers and data scientists to quickly and easily build, train, and deploy machine learning (ML) models at any scale. In this post, we focus on general techniques for improving I/O to optimize GPU performance when training on Amazon SageMaker, regardless of the underlying infrastructure or deep learning framework. You can typically see performance improvements up to 10-fold in overall GPU training by just optimizing I/O processing routines. A single GPU can perform tera floating point operations per second (TFLOPS), which allows them to perform operations 10–1,000 times faster than CPUs.
Jul-9-2020, 19:11:45 GMT
- Technology: