High-Order Attention Models for Visual Question Answering