Understanding Different Design Choices in Training Large Time Series Models