Image2Struct: Benchmarking Structure Extraction for Vision-Language Models Tony Lee

Open in new window