The Future of Large Language Model Pre-training is Federated