Incorporating Probing Signals into Multimodal Machine Translation via Visual Question-Answering Pairs

Open in new window