WikiDO: ANewBenchmarkEvaluating Cross-ModalRetrievalforVision-LanguageModels