Probing a Vision-Language-Action Model for Symbolic States and Integration into a Cognitive Architecture