Multimodal Procedural Planning via Dual Text-Image Prompting