SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation

Neural Information Processing Systems 

Large multimodal models (LMMs) have proven flexible and generalisable across many tasks and fields. Although they have strong potential to aid scientific research, their capabilities in this domain are not well characterised. A key aspect of scientific research is the ability to understand and interpret figures, which serve as a rich, compressed source of complex information.