Transformer-Encoder Trees for Efficient Multilingual Machine Translation and Speech Translation