HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation
–Neural Information Processing Systems
Human image animation involves generating videos from a character photo, allowing user control and unlocking the potential for video and movie production. While recent approaches yield impressive results using high-quality training data, the inaccessibility of these datasets hampers fair and transparent benchmarking. Moreover, these approaches prioritize 2D human motion and overlook the significance of camera motions in videos, leading to limited control and unstable video generation. To demystify the training data, we present HumanVid, the first large-scale high-quality dataset tailored for human image animation, which combines crafted real-world and synthetic data. For the real-world data, we compile a vast collection of real-world videos from the internet.
Neural Information Processing Systems
May-28-2025, 18:58:44 GMT
- Country:
- Asia > China (0.14)
- Europe (0.28)
- North America > United States (0.14)
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Media
- Film (0.52)
- Photography (0.52)
- Television (0.52)
- Media
- Technology: