Communication-Efficient Personalized Federated Learning for Speech-to-Text Tasks