Lumos : Empowering Multimodal LLMs with Scene Text Recognition

Open in new window