Video Diffusion Models are Training-free Motion Interpreter and Controller

Neural Information Processing Systems 

Video generation primarily aims to model authentic and customized motion across frames, making understanding and controlling the motion a crucial topic.