Interpretable and Globally Optimal Prediction for Textual Grounding using Image Concepts

Raymond Yeh, Jinjun Xiong, Wen-Mei Hwu, Minh Do, Alexander Schwing

Neural Information Processing Systems 

Textual grounding is an important but challenging task for human-computer interaction, robotics and knowledge mining.