Blocksworld Revisited: Learning and Reasoning to Generate Event-Sequences from Image Pairs