Aerial-ground Cross-modal Localization: Dataset, Ground-truth, and Benchmark