Breaking MLPerf Training: A Case Study on Optimizing BERT

Open in new window