Understanding and Bridging the Modality Gap for Speech Translation