InSpire: Vision-Language-Action Models with Intrinsic Spatial Reasoning

Open in new window