Lightweight Structured Multimodal Reasoning for Clinical Scene Understanding in Robotics

Open in new window