Incorporating Probing Signals into Multimodal Machine Translation via Visual Question-Answering Pairs