Mechanistic interpretability for steering vision-language-action models

Open in new window