Improving Automatic Parallel Training via Balanced Memory Workload Optimization

Open in new window