Generative Visual Foresight Meets Task-Agnostic Pose Estimation in Robotic Table-Top Manipulation