Analyzing Deep Transformer Models for Time Series Forecasting via Manifold Learning