Collaborating with language models for embodied reasoning