Generating by Understanding: Neural Visual Generation with Logical Symbol Groundings