Geometry-aware 4D Video Generation for Robot Manipulation