ContextVLA: Vision-Language-Action Model with Amortized Multi-Frame Context

Open in new window