Two-dimensional Sparse Parallelism for Large Scale Deep Learning Recommendation Model Training

Open in new window