A multilevel approach to accelerate the training of Transformers