Actra: Optimized Transformer Architecture for Vision-Language-Action Models in Robot Learning

Open in new window