PixelVLA: Advancing Pixel-level Understanding in Vision-Language-Action Model

Open in new window