EgoVLA: Learning Vision-Language-Action Models from Egocentric Human Videos

Open in new window