Transformer Training Strategies for Forecasting Multiple Load Time Series