A Multi-World Approach to Question Answering about Real-World Scenes based on Uncertain Input

Open in new window