Image2Struct: Benchmarking Structure Extraction for Vision-Language Models
–Neural Information Processing Systems
We introduce Image2Struct, a benchmark to evaluate vision-language models (VLMs) on extracting structure from images.
Neural Information Processing Systems
Nov-20-2025, 04:26:34 GMT
- Country:
- Asia > Japan (0.04)
- Europe > United Kingdom (0.04)
- North America
- Montserrat (0.04)
- United States > California
- Santa Clara County > Palo Alto (0.05)
- Oceania > Australia (0.04)
- Genre:
- Research Report (1.00)
- Industry:
- Government (0.67)
- Information Technology (0.67)
- Law > Intellectual Property & Technology Law (0.46)
- Technology: