ActPlan-1K: Benchmarking the Procedural Planning Ability of Visual Language Models in Household Activities

Open in new window