LLaPa: A Vision-Language Model Framework for Counterfactual-Aware Procedural Planning

Open in new window