Learning Physical Graph Representations from Visual Scenes