Experiment with Billion-Parameter Models Faster using DeepSpeed and Meta Tensors

Open in new window