Putting People in Their Place: Affordance-Aware Human Insertion into Scenes

Kulal, Sumith, Brooks, Tim, Aiken, Alex, Wu, Jiajun, Yang, Jimei, Lu, Jingwan, Efros, Alexei A., Singh, Krishna Kumar

Apr-27-2023–arXiv.org Artificial Intelligence

We study the problem of inferring scene affordances by presenting a method for realistically inserting people into scenes. Given a scene image with a marked region and an image of a person, we insert the person into the scene while respecting the scene affordances. Our model can infer the set of realistic poses given the scene context, re-pose the reference person, and harmonize the composition. We set up the task in a self-supervised fashion by learning to re-pose humans in video clips. We train a large-scale diffusion model on a dataset of 2.4M video clips that produces diverse plausible poses while respecting the scene context. Given the learned human-scene composition, our model can also hallucinate realistic people and scenes when prompted without conditioning and also enables interactive editing. A quantitative evaluation shows that our method synthesizes more realistic human appearance and more natural human-scene interactions than prior work.

artificial intelligence, machine learning, reference person, (14 more...)

arXiv.org Artificial Intelligence

Apr-27-2023

arXiv.org PDF

Add feedback

Genre:
- Research Report (0.64)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning > Neural Networks
      - Deep Learning (0.31)
    - Vision (1.00)
  - Sensing and Signal Processing > Image Processing (0.94)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found