Learn the Force We Can: Multi-Object Video Generation from Pixel-Level Interactions