scene graph
Country:
- North America > United States (0.14)
- Asia > Singapore (0.04)
- Europe > Netherlands > North Holland > Amsterdam (0.04)
- (2 more...)
Genre:
- Research Report > Experimental Study (1.00)
- Research Report > New Finding (0.92)
Industry:
- Information Technology > Security & Privacy (1.00)
- Health & Medicine > Therapeutic Area > Oncology (0.67)
Technology:
SpatialRGPT: Grounded Spatial Reasoning in Vision-Language Models
Vision Language Models (VLMs) have demonstrated remarkable performance in 2D vision and language tasks. However, their ability to reason about spatial arrangements remains limited. In this work, we introduce Spatial Region GPT (SpatialRGPT) to enhance VLMs' spatial perception and reasoning capabilities.
Country:
- South America > Brazil (0.04)
- North America > United States > California > San Diego County > San Diego (0.04)
- Europe > France > Bourgogne-Franche-Comté > Doubs > Besançon (0.04)
Genre:
- Research Report > New Finding (0.93)
- Research Report > Experimental Study (0.93)
Technology:
- Information Technology > Artificial Intelligence > Vision (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (0.84)
Synergistic Dual Spatial-aware Generation of Image-to-Text and Text-to-Image Y u Zhao
In the visual spatial understanding (VSU) area, spatial image-to-text (SI2T) and spatial text-to-image (ST2I) are two fundamental tasks that appear in dual form. Existing methods for standalone SI2T or ST2I perform imperfectly in spatial understanding, due to the difficulty of 3D-wise spatial feature modeling.
Country:
- North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
- Europe > Italy > Tuscany > Florence (0.04)
- Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
- (22 more...)
Technology:
Country:
- Asia > Middle East > Israel (0.04)
- Asia > China > Hong Kong (0.04)
- Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
- (3 more...)
Technology:
Country:
- Europe > Netherlands > North Brabant > Eindhoven (0.04)
- Europe > Germany > Hesse > Darmstadt Region > Darmstadt (0.04)
Genre:
- Research Report > Experimental Study (1.00)
- Research Report > New Finding (0.93)
Country:
- Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.05)
- Oceania > Australia > New South Wales > Sydney (0.04)
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
- North America > Canada > Quebec > Montreal (0.04)
Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)
Country: North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Technology: