Bench LanguageBenchmark
–Neural Information Processing Systems
Wefurther evaluated state-of-the-art models on this benchmark forthree vision-language tasks: image captioning, visual grounding, and visual question answering. Our work aims to significantly contribute to the development ofadvanced vision-language models inthefieldofremote sensing.
Neural Information Processing Systems
Feb-7-2026, 09:45:35 GMT