Improving speech translation by fusing speech and text