HUMANISE: Language-conditioned Human Motion Generation in 3D Scenes

Neural Information Processing Systems 

We automatically annotate the aligned motions with language descriptions that depict the action and the unique interacting objects in the scene; e.g ., sit on the armchair near the desk. HUMANISE thus enables a new generation task, language-conditioned human motion generation in 3D scenes . The proposed task is challenging as it requires joint modeling of the 3D scene, human motion, and natural language.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found