Grounding Classical Task Planners via Vision-Language Models