Scalable Transformer for PDE Surrogate Modeling