HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model

Open in new window