Generalizable Memory-driven Transformer for Multivariate Long Sequence Time-series Forecasting