Joint Training And Decoding for Multilingual End-to-End Simultaneous Speech Translation