InstructSAM: ATraining-Free Framework for Instruction-Oriented Remote Sensing Object Recognition
–Neural Information Processing Systems
Language-guided object recognition in remote sensing imagery is crucial for largescale mapping and automated data annotation. However, existing open-vocabulary and visual grounding methods rely on explicit category cues, limiting their ability to handle complex or implicit queries that require advanced reasoning. To address this issue, we introduce a new suite of tasks, including Instruction-Oriented Object Counting, Detection, and Segmentation (InstructCDS), covering open-vocabulary, open-ended, and open-subclass scenarios.
Neural Information Processing Systems
Jun-20-2026, 21:23:07 GMT
- Genre:
- Research Report > Experimental Study (1.00)
- Industry:
- Technology:
- Information Technology > Artificial Intelligence
- Vision (1.00)
- Machine Learning > Neural Networks (0.68)
- Natural Language
- Large Language Model (0.48)
- Text Processing (0.46)
- Information Technology > Artificial Intelligence