Unsupervised Alignment of Natural Language Instructions with Video Segments

Open in new window