Compositional Foundation Models for Hierarchical Planning

Neural Information Processing Systems 

Generated video plans are then grounded to visual-motor control, through an inverse dynamics model that infers actions from generated videos.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found