Text-driven Prompt Generation for Vision-Language Models in Federated Learning