BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual Questions

Open in new window