Multimodal Reranking for Knowledge-Intensive Visual Question Answering