Latency-aware Multimodal Federated Learning over UAV Networks