Compositional Transformers for Scene Generation