Beyond Static Perception: Integrating Temporal Context into VLMs for Cloth Folding

Open in new window