Answer Mining from a Pool of Images: Towards Retrieval-Based Visual Question Answering