Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models

Open in new window