PaSE: Parallelization Strategies for Efficient DNN Training