End-to-End Speech-to-Text Translation: A Survey