Show and Guide: Instructional-Plan Grounded Vision and Language Model

Open in new window