Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs

Open in new window