Detection-based Intermediate Supervision for Visual Question Answering

Open in new window