The Prospect of Enhancing Large-Scale Heterogeneous Federated Learning with Transformers