Communication-minimizing Asynchronous Tensor Parallelism

Open in new window