Efficient GPT Model Pre-training using Tensor Train Matrix Representation

Open in new window